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Opinions,  interpretations,  conclusions  and  recommendations  are 
those  of  the  author  and  are  not  necessarily  endorsed  by  the  U.S. 
Army. 

\/  vfhere  copyrighted  material  is  quoted,  permission  has  been 
obtained  to  use  such  material. 

material  from  documents  designated  for  limited 
'  distribution  is  quoted,  permission  has  been  obtained  to  use  the 
material . 

V  citations  of  commercial  organizations  and  trade  names  in 
this  report  do  not  constitute  an  official  Department  of  Army 
endorsement  or  approval  of  the  products  or  services  of  these 
organizations. 

In  conducting  research  using  animals,  the  investigator (s) 
adhered  to  the  "Guide  for  the  Care  and  Use  of  Laboratory 
Animals,"  prepared  by  the  Committee  on  Care  and  use  of  Laboratory 
Animals  of  the  Institute  of  Laboratory  Resources,  national 
Research  Council  (NIH  Publication  No.  86-23,  Revised  1985). 

For  the  protection  of  human  subjects,  the  investigator (s) 
^dh^red  to  policies  of  applicable  Federal  Law  45  CFR  46. 

_  In  conducting  research  utilizing  recombinant  DNA  technology, 

^e  investigator (s)  adhered  to  current  guidelines  promulgated  by 
the  National  Institutes  of  Health. 

In  the  conduct  of  research  utilizing  recombinant  DNA,  the 
instigator (s)  adhered  to  the  NIH  Guidelines  for  Research 
Involving  Recombinant  DNA  Molecules . 

In  the  conduct  of  research  involving  hazardous  organisms, 
n'investigator(s)  adhered  to  the  CDC-NIH  Guide  for  Biosafety  in 
jlicrobiological  and  Biomedical  Laboratories . 
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Progress  Report  for  the  DOD  Grant  DAMD 17-98-1-8042 

April  1998-April  1999  (first  year  of  support) 


During  the  first  year  of  support  by  the  grant  DAMD17-98-1-8042  (the  Idea  Grant)  we 
focused  on  several  projects  whose  current  status  is  described  below. 

Development  of  sitoxins  (signal-regulated,  cleavage-mediated  toxins) 

One  problem  with  the  earlier  design,  described  in  the  Idea  Grant  proposal,  was  a 
significant  toxicity  of  Cup9p,  which  was  used  as  a  portable  degradation  signal  (degron).  We 
addressed  this  problem  by  constructing  a  mutant  of  Cup9p  that  does  not  bind  to  its  high-affinity 
sites  in  the  S.  cerevisiae  DNA,  thereby  greatly  reducing  the  toxicity  of  Cup9p.  Specifically,  a 
missense  mutation  was  introduced  into  the  alpha-helical  region  of  Cup9p  that  is  known  to 
contact  the  major  groove  of  DNA  and  contribute  to  the  high-affinity  binding  by  Cup9p.  This 
mutation  retained  the  alpha-helix  of  Cup9p,  did  not  interfere  with  its  overall  folding,  but 
prevented  the  high-affinity  binding  of  Cup9p  to  DNA.  The  resulting  mutant  Cup9p  was  still 
short-lived  in  vivo,  but  was  no  longer  toxic,  even  at  high  expression  levels.  Further  work  on 
sitoxins  using  this  mutant  is  under  way. 

Construction  of  degradation  signals  in  the  lysine-asparagine  sequence  space 

Among  the  problems  to  be  solved  on  the  way  to  multitarget  conditional  toxins  is  the 
paucity  of  strong,  portable  N-degrons  that  are  free  of  drawbacks  of  the  existing  degrons  of  this 
class.  These  drawbacks  include  the  tendency  of  proteins  bearing  such  degrons  to  undergo 
endoproteolytic  cleavages  that  sever  the  N-degron  from  the  rest  of  the  protein,  and  thereby 
contribute  to  the  high  background  of  assays  with  N-degron-bearing  proteins.  As  a  part  of  our 
ongoing  effort  to  construct  better  degradation  signals,  we  developed  a  new  approach,  which 
makes  it  possible  to  identify  and  isolate  specific  degradation  signals  in  the  sequence  space  of  just 
two  amino  acids,  lysine  and  asparagine  A  paper  describing  these  results  is  being  written  up..  Our 
findings  are  briefly  summarized  below. 

The  N-degrons,  a  subset  of  degradation  signals  recognized  by  the  N-end  rule  pathway, 
comprise  a  protein’s  destabilizing  N-terminal  residue  and  an  internal  lysine  residue.  We  showed 
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that  the  strength  of  active  N-degron  can  be  markedly  increased,  without  loss  of  specificity, 
through  the  addition  of  lysine  residues.  To  explore  constraints  on  the  structure  of  N-degrons,  we 
took  advantage  of  a  single  third-letter  difference  between  the  codons  for  lysine  and  asparagine. 

A  nearly  exhaustive  screen  was  carried  out  for  N-degrons  in  the  lysine-asparagine  (K/N) 
sequence  space  of  the  14-residue  K/N  regions  (16,384  different  sequences).  68  of  these 
sequences  were  found  to  function  as  N-degrons,  and  3  of  them  were  at  least  as  active  as  any  of 
the  previously  known  N-degrons.  All  K/N-based  N-degrons  lacked  the  lysine  at  position  2,  and 
all  three  of  the  strongest  N-degrons  contained  lysines  at  positions  3  and  15.  Our  findings  support 
the  model  of  the  targeting  mechanism  in  which  the  binding  of  the  E3-E2  complex  to  the 
substrate’s  destabilizing  N-terminal  residue  is  followed  by  a  stochastic,  time-restricted  search  for 
a  sterically  suitable  lysine  residue.  The  strategy  of  screening  a  small  library  that  encompasses 
the  entire  sequence  space  of  two  amino  acids  should  be  of  use  in  many  settings,  including  studies 
of  protein  targeting  and  folding.  A  paper  describing  these  results  is  being  written  UB. 

We  plan  to  employ  the  lysine-asparagine  N-degron  discovered  in  the  present  work  in 
both  the  designs  of  sitoxins  above,  and  in  the  trans- targeting  screens  described  below. 

Bivalent  Inhibitor  of  Protein  Degradation  by  the  N-End  Rule  Pathway 

In  a  project  relevant  to  the  theme  of  this  grant,  we  have  developed  a  specific  bivalent 
inhibitor  of  the  N-end  rule  pathway.  The  relevance  of  this  advance  stems  from  the  fact  the 
bivalent  inhibitor  (described  below)  improved  the  understanding  of  the  targeting  (substrate¬ 
binding)  sites  of  N-recognin,  the  E3  component  of  the  ubiquitin-dependent  N-end  rule  pathway, 
which  is  at  the  center  of  our  sitoxins  and  comtoxin  projects.  Eventually,  the  designs  of 
conditional  toxins  should  be  able  to  involve  any  of  the  several  ubiquitin-dependent  proteolytic 
pathways;  the  current  focus  on  the  N-end  rule  pathway  reflects  not  only  the  interest  of  this 
laboratory  but  also  the  fact  that  this  pathway  is  particularly  well  understood  mechanistically. 
Moreover,  in  staying  within  the  N-end  rule  pathway,  it  is  possible  to  use  the  portable,  highly 
active  N-degrons,  an  advantage  that  is  not  available  with  most  of  the  other  ubiquitin-dependent 
pathways. 

The  N-end  rule  relates  the  in  vivo  half-life  of  a  protein  to  the  identity  of  its  N-terminal 
residue.  Ubrlp,  the  recognition  (E3)  component  of  the  Saccharomyces  cerevisiae  N-end  rule 
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pathway,  contains  at  least  two  substrate-binding  sites.  The  type  1  site  is  specific  for  N-terminal 
basic  residues  Arg,  Lys,  and  His.  The  type  2  site  is  specific  for  N-terminal  bulky  hydrophobic 
residues  Phe,  Leu,  Trp,  Tyr,  and  lie.  Previous  work  has  shown  that  dipeptides  bearing  either 
type  1  or  t5^e  2  N-terminal  residues  act  as  weak  but  specific  inhibitors  of  the  N-end  rule 
pathway.  We  took  advantage  of  the  two-site  architecture  of  Ubrlp  to  explore  the  feasibility  of 
bivalent  N-end  rule  inhibitors,  whose  expected  higher  efficacy  would  result  fi*om  higher  affinity 
of  the  cooperative  (bivalent)  binding  to  Ubrlp.  The  inhibitor  comprised  mixed  tetramers  of 
P-galactosidase  (Pgal)  that  bore  both  N-terminal  Arg  (type  1  residue)  and  N-terminal  Leu 
(type  2  residue)  but  were  resistant  to  proteolysis.  Expression  of  these  constructs  in  S.  cerevisiae 
inhibited  the  N-end  rule  pathway  much  more  strongly  than  the  expression  of  otherwise  identical 
Pgal  tetramers  whose  N-terminal  residues  were  exclusively  Arg  or  exclusively  Leu.  Our  findings 
are  the  first  evidence  that  the  type  1  and  type  2  sites  of  N-recognin  are  spatially  proximal  in  the 
225  kD  5.  cerevisiae  Ubrlp.  In  addition,  our  results  strongly  suggest  that  small  bivalent 
inhibitors  of  the  N-end  rule  pathway  are  feasible,  and  moreover,  are  expected  to  be  much  more 
potent  than  their  monovalent  counterparts.  Work  to  produce  such  inhibitors  is  under  way.  A 
paper  describing  these  findings  is  being  written  up. 

Construction  and  Use  of  the  Library  Expressing  Random  Fragments  of  the 
Green  Fluorescent  Protein  (GFP)  and  Sre  Protein  Kinase 

The  aim  of  this  ongoing  comtoxin  project  is  to  develop  a  generally  applicable  method  for 
interfering  with  the  in  vivo  function  of  a  target  protein  of  interest  through  the  expression  of  a 
specific  fragment  of  the  target  that  would  bind  to  a  conformationally  immature  (nascent)  target 
protein  and  thereby  either  arrest  or  strongly  retard  its  conformational  maturation.  As  a  result,  the 
complex  of  the  target  and  its  self-peptide  would  be  nonfunctional  insofar  as  the  target’s  function 
is  concerned.  In  addition,  the  interfering  peptide  would  be  equipped  with  a  strong  N-termmal 
degradation  signal  (the  N-degron,  previously  studied  by  this  laboratory).  A  permanently  or 
transiently  stable  complex  between  a  target  protein  and  its  self-peptide  bearing  an  N-degron 
would  be  targeted  for  processive  degradation  by  the  N-end  rule  pathway,  one  of  the  pathways  of 
the  ubiquitin  system.  Once  developed,  this  method  could  be  applied  to  effect  selective  functional 
inhibition  of  undesirable  proteins  such  as,  for  example,  overexpressed  oncoproteins.  This 
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trans-degradation  method  would  also  become  a  part  of  the  comtoxin  approach  described  in  the 
DOD  Idea  Grant  that  supports  these  studies. 

To  develop  the  trans-degradation  technique,  we  chose  Green  Fluorescent  Protein  (GFP) 
as  a  model  target  protein.  (Our  next,  clinically  relevant,  target  is  Src,  a  protein  tyrosine  kinase 
and  an  proto-oncoprotein  activated  in  a  number  of  human  cancers,  including  breast  cancer.)  We 
carried  out  the  construction  of  a  library  of  clones  which  expressed  polypeptide  fragments  derived 
from  random  segments  of  GFP.  These  fragments  were  expressed  in  the  yeast  5.  cerevisiae  from 
the  Pqal  promoter  as  linear  fusions  to  ubiquitin.  As  shown  previously  by  this  laboratory,  the 
N-terminal  ubiquitin  moiety  of  the  fusion  is  cotranslationally  cleaved  off  by  ubiquitin-specific 
proteases,  making  it  possible  to  expose  any  desired  amino  acid  residue  at  the  N-terminus  of  the 
resulting  polypeptide  fragment.  In  developing  this  approach,  we  are  proceeding  in  steps,  at  first 
constructing  a  ubiquitin  fusion-based  library  of  GFP  fragments  as  such.  The  second-generation 
library  will  have  the  same  fragments  bearing  the  40-residue  N-terminal  extension  called  Arg- 
e^*^,  which  contains  the  above-described  N-degron. 

The  library  expressing  random  GFP  fragments  was  constructed  using  DNase  I  in  the 
presence  of  Mn2+  ions  to  fragment  the  GFP  cDNA,  and  specific  oligonucleotide  adapters  to 
insert  the  resulting  fragments  into  the  expression  vector.  More  recently,  we  began  the 
construction  of  analogous  libraries  with  v-Src  and  c-Src  trans-targeting,  degron-containing 
reporter.  The  v-Src  and  c-Src  cDNAs  were  obtained  from  Dr.  D.  Morgan  (Univ.  of  California, 
Berkeley).  They  were  tested  by  sequencing  the  5’-  and  3 ’-ends  (-200  bp  each)  of  the  coding 
regions,  and  confirming  that  the  resulting  sequences  matched  those  for  v-Src  and  c-Src  in  the 
GenBank  database.  Restriction  fragments  containing  the  Src  genes  were  isolated  by  agarose  gel 
electrophoresis,  and  are  being  used  to  construct  random-fragment  libraries  as  described  above. 
Taking  advantage  of  the  unique  Nco  I  site  at  the  5’-end  the  v-Src  cDNA,  the  v-Src  coding  region 
(as  a  filled-in  Nco  I-Xho  I  fragment)  was  inserted  into  the  integration  vector  pIP123-CUPR. 

This  placed  the  Src  ORF  under  the  control  of  the  copper-inducible  Pcupi  promoter.  It  remains  to 
be  determined,  in  this  ongoing  project,  whether  this  promoter  is  sufficiently  tight  (low  enough 
level  of  Src  in  the  uninduced  state)  and  sufficiently  strong  (high  enough  level  of  Src  in  the 
induced  state  to  cause  growth  arrest).  (In  contrast  to  the  GFP-based  library,  where  the  screen  is 
being  carried  out  by  fluorescence-activated  cell  sorting  (FACS),  our  readout  and  the  screen  with 
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Src  are  based  on  the  known  toxicity  of  mammalian  Src  to  the  yeast  S.  cerevisiae.)  Examination 
of  the  three-dimensional  structures  of  Src  family  tyrosine  kinases  shows  that  a  loop  connecting 
the  SH2  domain  to  the  kinase  domain  is  bound  to  the  SH3  domain  of  Src  and  sandwiched 
between  the  SH3  domain  and  the  N-terminal  half  of  the  kinase  when  Src  is  in  the  inactive  state. 
The  kinase-inhibiting  interaction  requires  phosphorylation  of  Src  at  its  C-terminal  region,  itself 
bound  to  the  SH2  domain.  This  area  is  a  promising  candidate  for  a  trans-targeting  N-degron. 

Our  nearest  aims  in  this  set  of  projects  are,  first,  to  optimize  the  library-based  interference 
approach  described  above,  using  GFP  as  both  the  target  and  reporter,  then  to  expand  this 
approach  to  the  Src  kinase,  and  to  link  an  N-degron  to  the  interfering  fragments  identified  in  both 
the  GFP  and  Src  library  screen.  This  multistep  strategy  should  result  in  a  generally  applicable 
method  for  metabolically  destabilizing  (and  thereby  functionally  inhibiting)  any  protein  of 
interest,  including  the  targets  that  must  be  down-regulated  for  either  arresting  or  selectively 
killing  the  breast  adenocarcinoma  cells. 


Enclosed  with  this  Progress  Report  are  our  recently  published  papers  (1998-99;  one  set), 
and  the  CV  of  PI.  The  two  manuscripts-in-preparation  mentioned  above  should  be  finished  in 
less  than  2  months.  The  Progress  Report  describes  projects  that  were  supported,  either  partially 
or  entirely,  by  the  DOD  Idea  Grant  during  its  first  year. 
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Analysis  of  a  conditional  degradation  signal  in  yeast  and  mammalian  cells 

Frederic  Levy*,  Jennifer  A.  Johnstont  and  Alexander  Varshavsky 

Division  of  Biology,  California  Institute  of  Technology,  Pasadena,  CA,  USA 


The  N-end  rule  pathway  is  a  ubiquitin-dependent  proteolytic  system,  the  targets  of  which  include  proteins  that  bear 
destabilizing  N-terminal  residues.  The  latter  are  a  part  of  the  degradation  signal  called  the  N-degron.  Arg-DHFR‘\ 
an  engineered  N-end  rule  substrate,  bears  N-terminal  arginine  (a  destabilizing  residue)  and  DHFR''  [a  temperature- 
sensitive  mouse  dihydrofolate  reductase  (DHFR)  moiety].  Previous  work  has  shown  that  Arg-DHFR"'  is  Iona-lived 
at  23  “C  but  short-lived  at  37  °C  in  the  yeast  Sacchawmyces  cerevisiae.  In  the  present  work,  we  extended  this 
analysis,  and  found  that  the  degradation  of  Arg-DHFR"  can  be  nearly  completely  inhibited  in  vivo  by  methotrexate 
(MTX),  a  low-A/r  ligand  of  DHFR.  In  S.  cerevisiae,  Arg-DHFR''  is  degraded  at  37  “C  exclusively  by  the  N-end  rule 
pathway,  whereas  in  mouse  cells  the  same  protein  at  the  same  temperature  is  also  targeted  by  another  proteolytic 
system,  through  a  degron  in  the  conformationally  perturbed  DHFR''  moiety.  In  mouse  cells.  MTX  completely 
inhibits  the  degradation  of  Arg-DHFR''  through  its  degron  within  the  DHFR''  moiety,  but  only  partially  inhibits 
degradation  through  the  N-degron.  When  the  N-terminus  of  Arg-DHFR''  was  extended  with  a  42-residue  lysine- 
lacking  extension,  termed  e-^*^.  the  resulting  Arg-e'^’^-DHFR''  was  rapidly  dearaded  at  both  23  “C  and  37  °C 
Moreover,  the  degradation  of  Arg-e'^'^-DHFR''.  in  contrast  with  that  of  Arg-DHFR'*'.  could  not  be  inhibited  by  MTX 
suggesting  that  the  metabolic  stability  of  Arg-DHFR''  at  23  T  results,  at  least  in  part,  from  steric  inaccessibility  of 
Its  N-terminal  arginine.  The  N-degron  of  Arg-DHFR*'  is  the  first  example  of  a  portable  degradation  signal  the 
activity  of  which  can  be  modulated  in  vivo  by  a  cell-penetrating  compound.  We  discuss  implications  of  this  advance 
and  the  mechanics  of  targeting  by  the  ubiquitin  system. 

Keywords:  degron;  methotrexate;  N-end  rule;  proteolysis;  ubiquitin. 


A  number  of  regulatory  circuits,  including  those  that  control  the 
cell  cycle,  cell  differentiation  and  responses  to  stress,  involve 
metabolically  unstable  proteins  [1-6].  A  short  in  vivo  half-life 
of  a  regulator  provides  a  way  to  generate  its  spatial  gradient  and 
allows  rapid  adjustments  of  its  concentration,  or  subunit 
composition,  through  changes  in  the  rate  of  its  synthesis.  A 
protein  can  also  be  conditionally  unstable,  i.e.  long-lived  or 
short-lived,  depending  on  the  state  of  the  cell.  One  example  of 
the  latter  class  are  cyclins,  a  family  of  related  proteins  the 
destruction  of  which  at  specific  stages  of  the  cell  cycle  regulates 
cell  division  and  growth  [3,7]. 

Features  of  proteins  that  confer  metabolic  instability  are  called 
degradation  signals,  or  degrons  [8].  An  essential  component  of 
one  degradation  signal,  called  the  N-degron,  is  a  destabilizing 
N-terminal  residue  of  a  protein  [9,10].  A  set  of  N-degrons 
containing  different  N-terminal  residues  which  are  destabilizing 
in  a  given  cell  yields  a  rule,  termed  the  N-end  rule,  which  relates 
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the  in  vivo  half-life  of  a  protein  to  the  identity  of  its  N-terminal 
residue.  In  eukaryotes,  the  N-end  rule  pathway  is  a  part  of  the 
ubiquitin  (Ub)  system.  Ub  is  a  76-residue  protein  whose 
covalent  conjugation  to  other  proteins  plays  a  role  in  a  multitude 
of  processes,  including  cell  growth,  differentiation  and  responses 
to  stress  [2,4,1 1-13].  In  many  of  these  settings,  Ub  acts  through 
routes  that  involve  the  degradation  of  Ub— protein  conjugates  by 
the  26S  proteasome,  an  ATP-dependent  multisubunit  protease 
[14,15]. 

In  eukaryotes,  linear  Ub-protein  fusions  are  rapidly  cleaved 
by  Ub-specific  proteases  (UBPs)  at  the  Ub-protein  junction, 
making  possible  the  production  of  otherwise  identical  proteins 
bearing  different  N-terminal  residues,  a  technical  advance  that 
led  to  the  discovery  of  the  N-end  rule  [9.10].  When  mouse 
dihydrofolate  reductase  (DHFR),  a  20-kDa  monomeric  protein, 
was  expressed  as  a  Ub-Arg-DHFR  fusion  in  the  yeast  Sac- 
charomyces  cerevisiae  at  30  °C,  the  resulting  Arg-DHFR  (pro¬ 
duced  through  deubiquitylation)  was  long-lived  (/0.5  >  4  h), 
even  though  it  bore  N-terminal  Arg,  a  destabilizing  residue  [16]. 
By  contrast,  a  modified  protein,  Arg-e*^-DHFR  (produced  from 
Ub-Arg-e*^-DHFR),  which  bore  a  42-residue,  lysine-containing 
extension  (denoted  as  e*^)  between  the  N-terminal  Arg  residue 
and  the  first  (Val)  residue  of  DHFR,  was  short-lived  in  vivo  (to^ 
of  =»  10  min),  being  degraded  by  the  N-end  rule  pathway  [16]. 

This  and  other  evidence  indicated  that  the  N-degron  com¬ 
prises  two  essential  determinants:  a  destabilizing  N-terminal 
residue  and  an  internal  Lys  residue  (or  residues)  of  a  substrate. 
The  Lys  residue  is  the  site  of  attachment  of  a  multi-Ub  chain 
[16-18].  Arg-e’^-DHFR  contains  a  complete  N-degron  because 
it  bears  both  an  N-terminal  destabilizing  residue  (Arg)  and,  in 
the  e  extension,  at  least  one  targetable  Lys  residue.  In  contrast, 
the  N-degron  of  Arg-DHFR  is  incomplete,  for  at  least  one  of  the 
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two  mutually  non-exclusive  reasons;  the  absence  of  Lys  residues 
accessible  to  the  targeting  complex  of  the  N-end  rule  pathway 
and/or  poor  accessibility  of  the  N-terminal  Arg  in  Arg-DHFR  to 
the  same  targeting  complex.  Since  the  20-kba  mouse  DHFR 
contains  16  Lys  residues,  this  interpretation  presumes  that,  in  a 
folded  DHFR  molecule,  the  lysines  are  ineffective  as  ubiquity- 
lation  sites,  becau.se  of  the  lysine’s  lack  of  mobility  and/or  its 
di.stance  from  a  destabilizing  N-terminal  residue  [lOJ. 

Previous  work  |I9]  described  a  temperature-sensitive  Us) 
allele  of  DHFR  that  was  used  to  construct  a  heat-inducible 
N-degron.  in  a  Ub-Arg-DHFR'''-Ura3p  fusion  which  contained 
5.  cerevisiae  Ura3p  as  a  C-terminal  reporter.  The  Ub-Arg- 
DHFR'-  moiety  of  this  fusion  was  identical  with  Ub-Arg-DHFR 
above,  except  for  a  Pro  — »  Leu  alteration  at  position  66  of  the 
DHFR  moiety.  Arg-DHFR'''-Ura3p  was  long-lived  at  23  °C  but 
short-lived  at  37  °C  in  yeast.  Moreover,  the  Ub-Arg-DHFR'" 
module  of  Ub-Arg-DHFR''-Ura3p  was  shown  to  be  portable,  in 
that  linking  it  to  a  protein  of  interest  conferred  heat-inducib|e 
metabolic  in.stability  on  the  entire  fusion  |I9]. 

In  addition  to  yielding  a  new  method  for  the  construction  of  ts 
mutants,  the  heat-inducible  N-degron  [19]  provided  an  approach 
to  analyzing  the  mechanism  of  targeting  by  the  N-end  rule 
pathway.  In  carrying  out  this  analysis,  described  below,  we  have 
found  that  the  degradation  of  Arg-DHFR'"  can  be  inhibited  in 
vivo  by  methotrexate  (MTX),  a  low-M^  ligand  of  DHFR.  We 
also  show  that,  in  mouse  cells.  Arg-DHFR'"  is  targeted  by  both 
the  N-end  rule  pathway  and  another  proteolytic  .system,  whereas 
in  yeast  the  same  te.st  protein  is  exclusively  an  N-end  rule 
sub.strate.  The  N-degron  of  Arg-DHFR'"  is  the  first  example  of  a 
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portable  degradation  .signal  who.se  activity  can  be  modulated  in 
vivo  by  a  cell-penetrating  compound.  ■■  , 

-  y  ' 

• 

MATERIALS  AND  METHODS  ^  , 

Plasmids  for  expression  in  S.  cerevisiae 

The  plasmid  pJJRtd,  which  expressed  Ub-Arg-DHPR’^-ha 
(Fig.  1.  construct  I),  was  based  on  the  vector  pRS3I6  [20].  A 
1.0-kb  EcoRMHindUl  fragment  of  pPW58  [8]  was  ligated  into 
pRS3i6,  followed  by  the  cloning  of  the  Pci’p/  promoter- 
containing  0.4  kb  EcoRUSacl  fragment  of  pPW48  [19]  into  the, 
Err^RI  site,  a  step  that  yielded  pJJRtd.  pJJMtd,  which  expressed 
Ub-Met-DHFR‘''-ha,  was  prepared  by  ligating  the  large 
AflU  fragment  of  pJJRtd  to  a  small  fragment  from  the  A^el/AflW 
digest  of  pLGUbM-DHFRha  [21],  " 

Plasmids  for  expression  in  mammalian  cells 

Mouse  DHFR  and  DHFR*''  open  reading  frames  bearing  the 
desired  restriction  sites  at  their  5'  and  3"ends  were  produced 
using  PCR.  All  DHFR  and  DHFR*''  moieties  were  flanked,  at 
their  C-termini,  with  the  ha  epitope  |22]  (Fig.  1),  and  were 
cloned  into  the  pRc/CMV  vector  (Invitrocen.  San  Diego,  CA 
USA).  ^ 

Constructs  II  and  III  (Fig.  1)  were  produced  by  PCR 
amplification  of  a  fragment  encoding  Met-DHFR*''  and  Arg- 
DHFR*\  respectively,  using  construct  I  as  a  template.  The  N- 
terminal  Met  of  construct  II  was  specified  by  the  PCR  primer 


Fig.  1.  Test  proteins.  Fusion.s  used  in  this  work 
contained  .some  of  the  following  moieties:  the  Ub 
moiety,  either  N-terminal  (con.structs  I,  IV-VII) 
or  placed  between  other  moieties  (constructs  II 
and  III):  an  Arg  residue  at  the  junction  between 
Ub  and  the  C-terminal  part  of  the  fusion 
(constructs  I,  III,  V  and  VII):  a  Met  residue  at  the 
same  junction  (constructs  II.  IV,  and  VI):  a  42- 
residue  E.  coU  Lac  repressor-derived  sequence, 
termed  e*^*^  [extension  (e)  lacking  lysines  (AK)], 
between  Ub  and  the  reporter  part  of  the  fusion  (see 
the  main  text  and  [22,27])  (constructs  IV-VII): 
DHFRha,  a  mouse  DHFR  moiety  extended  at  the 
C-terminus  by  a  sequence  containing  the  hemag¬ 
glutinin-derived  ha  epitope  (constructs  I-VII): 
DHFR‘\  a  mutant  temperature-sensitive  DHFR 
moiety  that  differs  from  DHFR  by  the  Pro  — ►  Leu 
alteration  at  position  66  [19]  (.see  the  main  text). 
Amino  acid  residues  are  indicated  by  their  single¬ 
letter  abbreviations.  Residues  that  vary  between 
constructs  are  boxed  in  the  sequences  above  the 
diagrams. 


Ub  DHFRt®  ha 

MQ . GG0HGSGIMV...L . KKGTYP...FL 


I 


ha  Ub-Arg-DHFRts 


DHFR  ’  ha  Ubn48  DHFR*®  ha 

. ,KKGT . FL  MQ..R.GG0FIGSGIMV...L . KKGTYP...FL 

\  /  Jy 

M-J  DHFR  Ihal-4!lii-M-r  DHFR^®  lhal  dha-Uh-Met-ni^ 


II  ""--I  unt-H  |na[-^i2ip-M-|  DHFR»®  |ha|  dha-Ub-Met-DHFRts 

II I  M--I  DHFR  |ha|-ffl|-R  -[  DHFR^®  [haj  dha-Ub-Arg-DHFR»  1 

Ub  e^K  DHFR'®  ha 

MQ....GG|^HG..R.R...RSGI  MV...L . KKGTYP...FL 

1  76  1  1517  186 

I  II  ^ 

IV  DHFR««  |ha|  Ub-MeW'K.DHFW 

t_  • 

V  fl^R-e^K-l  DHFRt®  |ha|  Ub-Arg-eAK.DHFR‘s  " 

VI  — e^K-  DHFR  |h^  Ub-Met-e^l^DHFR 

- — - -  I 

\/l  I  R  —  ^  DHFR  ha  Ub-Arg-e^^DHFR 
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annealing  to  the  5'  end  of  the  fragment.  The  PCR  product.s  were 
digested  with  Xhol.  blunted  with  Klenow  Pol  I,  followed  by  a 
cut  with  A  fragment  bearing  an  open  reading  frame  was 

inserted  between  the  blunt-ended  Not\  site  and  the  5«(  II  site  of 
the  plasmid  pRc/dhaUbMbgal  [231.  resulting  in  the  plasmids 
pRc/dha-Ub-Met-DHFR"'  and  pRc/dha-Ub-Arg-DFIFR''.  Met- 
and  Arg-DFIFR''  contained  the  si.x-residue  sequence  His-Gly- 
Ser-Gly-Ile-Met  between  the  N-terminal  residue  and  Val.  the 
first  natural  residue  of  DHFR  (Fig.  I). 

To  produce  constructs  IV-VinFig.  I).  a  fragment  encoding 
Arg-DHFR''  was  amplified  by  PCR.  using  construct  I  as 
template.  The  PCR  product  was  digested  with  Xhal.  and  blunted 
with  Klenow  Pol  1.  followed  by  a  5«cll  cut.  The  resulting 
fragment  was  inserted  between  the  blunt-ended  Not\  .site  and  the 
5r((il  site  of  pRc/UbSPgal.  yielding  pRc/Ub-Arg-DHFR''.  The 
latter  served  as  a  vector  for  the  constructs  IV-VIl.  Constructs 
IV  and  V  were  obtained  by  amplifying  fragments  encoding, 
respectively.  Met-e^''-DHFR''  and  Arg-e^'^-DHFR'\  using  the 
plasmid  pLG/Ub''^'’-Val-e-^'^DHFRha  [22]  as  a  template.^The 
N-terminal  residues  Met  and  Arg  were  specified  by  a  PCR 
primer  annealing  to  the  5'  end  of  the  fragment.  Digestion  of  the 
PCR  pioducts  with  5«c'Il  and  PflM]  yielded  a  fragment  coding 


for  Met-  or  Arg-e-^^  and  the  first  39  residues  of  DHFR 
(DHFR|_3ii).  iPJlMl  cleaves  the  DHFR''  open  reading  frame 
upstream  of  the  mutated  codon  at  position  66.)  This  fragment 
was  inserted  in-frame  between  the  Sacll/PJIMl  sites  of  the 
plasmid  pRc/Ub-Arg-DHFR‘'.  yielding  plasmids  expressing  Ub- 
Met-e  ■'-DHFR''  and  Ub-Arg-e-^^-DHFR''.  Constructs  vl  and 
VII  were  produced  by  exchanging  a  SnaBX/Pflnil  fragment  of 
pRc/dhaUbNe^DHFRha  with  the  SiiaBl/PJlM]  fragments 
derived  from  constructs  IV  and  V.  respectively.  Thrs  step 
eliminated  the  fragment  encoding  dha-Ub-Asn-e^-DHFR|_|,  of 
the  original  plasmid,  replacing  it.  in-frame,  with  the  sequence 
encoding  either  Ub-Met-e'^'^-DHFRi.,,  or  Ub-Arg-e'^'^- 
DHFRj.i,.  (The  plasmid  pRc/dhaUbNe'^DHFRha  was"  con¬ 
structed  by  replacing  the  region  encoding  Pgal  in  pRc/ 
dhaUbXpgal  [23]  with  the  region  encoding  DHFR.) 

Yeast  and  mouse  cell  cultures,  transfection,  and  pulse-chase 
analysis 

5.  ccrevhiae  were  grown  and  manipulated  as  described 
previously  [24]  and  in  the  legend  to  Fig.  2.  Mouse  L  cells,  a 
fibroblast-like  cell  line  (ATCC  CCl'i.3.  American  Type 
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.^VDHFR  (Ub-Arg-DHFR  )  were  labeled  for  4  mm  at  23  “C  with  l''Slmethionine/cystei„e.  followed  by  a  chase  for  10  and  30  min  as  de.scribed  [40]  .hen 
e.x,rac.,on.  .minunoprecpuanon.  and  Tncine-based  SDS/PAGE  (12%  gel)  of  Arg-DHFR"'  |41].  Lanes  d-f.  same  as  lanes  a-c.  except  that  immediately  after  the 
4-mm  pulse  cells  were  shifted  to  37  C.  and  the  chase  was  carried  out  at  37  “C.  Lanes  g-i.  same  as  lanes  d-f.  but  an  independent  pulse-chase  experimL  Lanes 

Plasmid-bome  t^flC2  gene  [42].  Lanes  p-r.  same  as  lanes  m-o.  but  with  KMY613  Mr2J )  lacking  the  plasmid-bome  t/flC2.  Lanes  s-u.  JD47.13c  (UBBI)  cells 

-re  labeled  and  proces.sed  as  descrit^d  for  lanes  d-f.  Lanes  v-x.  same  as  lan^tfu  bm 
f  RRM  .1^  M  0^20  PM  30  min  before  the  pulse.  (B)  Lane  M.  molecular  mass  markers.  Lanes  a-c.  JD47.13c 

hifted  o  37'»C  7T'"^d  h"'  h  [''Slmethionine/cysteine.  Immediately  thereafter,  the  cells  were 

■  but  with  Ar^^  DhS'  I  ^  immunoprecipitalion.  and  SDS/PAGE  of  Met-DHFR''.  Unes  d-f.  same  as  lanes  a- 

Lne  r  ■  K  ■  IT  30  min  before  the  pulse  to  a  final  concentration  of  20  pM. 

Lnes  m“  ;  ITw  h  Ar»  DHF^“  T  ^  3’  “C-  Lanes  p-r.  same  as 

‘n  r  n!r  ^TX  added  to  the  growth  medium  30  min  before  the  pulse  to  a  final  concentration  of 

_()  p.M.  Lanes  v-x,  same  as  lanes  s-u,  but  with  Arg-DHFR  . 
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•  Culture?  Collection.  Rockville.  MD.  USA),  were  grown  as 
monolayers  in  Dulbecco's  modified  Eagle's  medium/F12 
medium,  supplemented  with  \07c  fetal  bovine  serum,  antibiotics, 
2  niM  L-glutamine  and  20  mM  Hepes  (.sodium  salt;  pH  73).  The 
cultures  were  regularly  checked  for  the  ab.sence  of  mycoplasmas. 
Ttansient  transfections  and  pulse— chase  analyses  were  per¬ 
formed  as  de.scribed  [2.^].  For  the  pulse-chases  at  23  ‘’C.  cells 
were  transferred  to  this  temperature  30  min  before  labeling  for 
10  min  with  0.1  mCi  (1  Ci  =  37  GBq)  of  r’‘‘S]EXPRESS  (New 
England  Nuclear.  Boston.  MA.  USA).  Cells  were  chased  at 
either  23  C  or  37  "C  in  the  complete  medium  containing 
0.1  mg  mL"'  cycloheximide.  All  media  were  pre-equilibrated  at 
the  desired  temperature.  In  experiments  that  involved  MTX,  the 
drug  was  added  30  min  before  labeling  and  was  pre.sent  at  a 
final  concentration  of  20  p,M  (diluted  from  20  niM  MTX  in 
0.9  M  potassium/.sodium  phosphate,  pH  7.3).  The  samples  were 
analyzed  by  SDS/PAGE  (I59c  gel),  followed  by  autoradio¬ 
graphy  and  quantitation  u.sing  a  Pho.sphorlmager  (Molecular 
Dynamics.  Sunnyvale.  CA.  USA). 

To  distinguish  between  partial  half-lives  that  approximate  the 
slopes  of  different  regions  of  a  non-exponential  decay  curve,  a 
generalized  half-life  term,  /d:,'.  was  used,  where  0.5  denotes  the 
parameter’s  half-life  asj^ct  and  y-c  denotes  the  relevant  time 
interval,  from  y  to  :  min  of  chase.  Another  term  (23J.  initial 
decay  (ID),  expressed  as  a  percentage,  equals  I  minus  the  ratio 
of  the  amount  of  a  radiolabeled  X-DHFR  test  protein  (normal¬ 
ized  against  the  reference  protein,  Met-DHFR-Ub;  Fig.  1 )  at  the 
end  of  the  pulse  (time  0)  to  the  (normalized)  amount  of  a 
radiolabeled  long-lived  X-DHFR  such  as  Met-DHFR.  As  ID 
may  depend  on  the  duration  of  a  pulse,  a  superscript,  in  ID', 
invokes  the  pulse  time  explicitly  [23], 

RESULTS 

Inhibition  of  Arg-DHPR'*  degradation  by  MTX  in 
S,  cerevisiae 

The  major  test  protein  of  this  work  was  Arg-DHFR"  (produced 
from  Ub-Arg-DHFR")  (Fig.  1,  construct  I),  identical  with  the 
Arg-DHFR'''  moiety  of  a  larger  fusion  described  in  the  intro¬ 
duction.  The  test  proteins  bore  C-terminal  ha.  a  hemagglutinin- 
derived  epitope  tag  recognized  by  a  monoclonal  antibody  [22] 
(Fig.  1;  the  ha  epitope  is  omitted  below  in  the  names  of  specific 
test  proteins).  For  the  analysis  in  5.  cerevisiae.  Arg-DHFR'"  was 
expressed  from  the  copper-inducible  Pcup/  promoter  in  a  low- 
copy  vector,  and  the  metabolic  fate  of  Arg-DHFR'"  was  deter¬ 
mined  in  a  pulse-chase  assay  (see  Materials  and  methods). 
Similarly  to  the  previously  examined  Arg-DHFR'"-Ura3p  and 
Arg-DHFR'"-Cdc28p  [19],  Arg-DHFR'"  was  a  long-lived  protein 
at  23  °C  (/()  5  >  4  h)  but  a  short-lived  protein  at  37  °C  (to  5  = 

4  min)  (Fig.  2A,  lanes  a-c  vs.  d-f).  The  degradation  of  Arg- 
DHFR  at  37  C  was  carried  out  by  the  N-end  rule  pathway,  as 
indicated  by  (a)  the  requirement  for  Ubrlp  (E3),  the  recognition 
component  of  this  pathway  (Fig.  2A,  lanes  g-i  vs.  j-1),  (b)  the 
requirement  for  Ubc2p  (E2),  the  relevant  Ub-conjugating  enzyme 
(Fig.  2A,  lanes  m-o  vs.  p-r),  and  (c)  the  long  half-life  of  Met- 
DHFR'",  an  otherwise  identical  protein  that  bore  N-terminal 
Met.  a  stabilizing  residue,  instead  of  Arg  (Fig.  2B,  lanes  a-c  vs 
d-f). 

Previous  work  [27]  has  shown  that  the  degradation  of  Arg-e*^- 
DHFR  by  the  N-end  rule  pathway  in  an  extract  from  rabbit 
reticulocytes  can  be  inhibited  by  the  folate  analog  MTX,  a  high- 
affinity  DHFR  ligand  (Ka  «  10  pM).  It  was  far  from  clear 
whether  MTX  would  have  the  same  effect  in  vivo,  in  part 
because  the  purified  N-end  rule  substrates  added  to  the  extract 
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contained  the  folded  DHFR  moieties  |27|.  By  contra.si.  a  na.scent 
DHFR-based  substrate  is  unable  to  bind  MTX  until  after  the 
folding  of  DHFR  (28j.  but  can  be  targeted  by  the  N-end  rule 
pathway  in  vivo  at  any  time,  possibly  even  durinst  translation 
116]. 

The  metabolic  fate  of  Arg-DHFR'"  at  37  'C  was  monitored  in 
5.  cerevisiae  in  the  absence  or  pre.sence  of  20  pM  MTX  in  the 
medium.  MTX  was  found  to  be  an  efficacious  inhibitor  of  .Arg- 
DHFR"  degradation  in  5.  cerevisiae  (Fig.  2B.  lanes  d-f  and  p-r 
vs.  j-1  and  v-x).  MTX  inhibited  the  degradation  of  Arg-DHFR'" 
legaidless  of  whether  it  was  added  to  the  growth  medium 
15  min  or  24  h  before  the  pulse-chase  assay  (data  not  shown; 
the  growth  of  cells  was  not  affected  significantly  by  20  pM 
MTX  for  at  least  24  h).  The  protective  effect  of  MTX  on  Arg- 
DHFR  was  essentially  the  same  regardless  of  whether  the  pulse 
labeling  (followed  by  the  37  °C  chase)  was  cairied  out  at  23  °C 
(Fig.  2B.  lanes  d-f  vs.  j-1)  or  at  37  °C  (Fig.  2B.  lanes  p-r  vs. 
v-x). 

The  degradation  (at  37  °C)  of  Arg-DHFR''-Cdc28p.  which 
differed  from  Arg-DHFR'"  (produced  from  Ub-Arg-DHFR'")  by 
the  piesence  of  the  Cdc28p  kinase  moiety,  was  al.so  inhibited  by 
MTX  (Fig.  2A,  lanes  s-u  vs.  v-x).  Arg-DHFR'"-Cdc28p.  which 
was  long-lived  at  23  °C.  or  at  37  T  in  iihrJA  cells,  could 
function  as  a  Cdc28p  kina.se  in  vivo  [19].  As  Arg-DHFR'"- 
Cdc28p  was  short-lived  at  37  °C  in  UBRI  S.\erevi.siae 
(Fig.  2A.  lanes  s-u).  the  cells  that  akso  cairied  a  mutant  (/.v) 
cdclt)  allele  ( 19|  were  not  viable  at  37  °C.  owing  to  the  absence 
of  the  e.ssential  Cdc28p  kinase  at  this  temperature.  The  addition 
of  MTX  (at  20  pM)  was  found  to  re.scue  these  cells  at  37  °C 
through  the  inhibition  of  degradation  of  Arg-DHFR''-Cdc28p 
(Fig.  2 A.  lanes  .s-u  vs.  v-x.  and  data  not  .shown).  The  effect  of 
MTX  was  specific  for  DHFR-containing  substrates,  as  degrada¬ 
tion  of  unrelated  test  proteins,  for  example  Arg-Pgal  (Ub-Arg- 
Pgal),  by  the  N-end  rule  pathway  was  unimpaired  in  the 
presence  of  MTX  (data  not  shown). 

DHFR  as  an  N-end  rule  substrate  in  mammalian  cells 

DHFR-based  substrates  of  the  N-end  rule  pathway  were  used  in 
the  e^lier  studies  with  S.  cerevisiae  [16,19],  but  have  not  been 
examined  in  mammalian  cells.  We  used  L  cells,  a  mouse 
fibroblast-like  cell  line,  and  the  ubiquitin/protein/reference 
(UPR)  technique,  which  increases  the  accuracy  of  pulse-chase 
assays  [26]  by  providing  a  reference  protein  [23],  In  the  UPR 
technique,  Ub  is  located  between  a  protein  of  interest  and  a 
reference  protein  in  a  linear  fusion  (e.g.  construct  II  in  Fig.  1). 
This  fusion  is  cleaved,  cotranslationally  or  nearly  so.  by  UBPs  at 
the  last  residue  of  Ub,  producing  equimolar  amounts  of  the 
protein  of  interest  and  the  reference  protein  bearing  a  C-terminal 
Ub  moiety.  If  both  the  reference  protein  and  the  protein  of 
interest  are  immunoprecipitated  in  a  pulse-chase  assay,  the 
relative  amounts  of  the  protein  of  interest  can  be  normalized 
against  the  reference  in  the  same  sample.  The  UPR  technique 
can  thus  compensate  for  the  scatter  of  immunoprecipitation 
yields,  sample  volumes  and  other  sources  of  sample-to-sample 
variation  [23]. 

The  UPR  constructs  of  the  present  work  were  fusions 
containing  the  metabolically  stable  Met-DHFR-Ub  moiety  as  a 
reference  protein  (termed  dha-Ub)  and  a  DHFR-based  N-end 
ntle  substrate  such  as,  for  example,  Arg-DHFR'"  as  a  protein  of 
interest  (Fig.  1,  construct  III).  To  preclude  the  possibility  that 
ihs  C-terminal  Ub  moiety  of  dha-Ub  could  function  as  a 
ubiquitylation/degradation  signal,  the  Lys48  residue  of  Ub  (a 
major  site  of  isopeptide  bonds  in  multi-Ub  chains  [17,22.29]) 
was  converted  into  Arg,  which  cannot  be  ubiquitylated  [12]. 
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Fig.  3.  Inhibition  of  Arg-DHFR'"  degradation 
in  mouse  cells  by  MTX.  (A)  Lanes  a-d,  mouse  L 
cells  transiently  expressing  a  U PR-based  version 
of  Met-DHFR'^  (Met-DHFR-ha-Ub'^^^Met- 
DHFR‘'-ha:  see  Fig.  1.  construct  II)  were  labeled 
at  23  °C  for  10  min  with  ( ^'Slmethionine/ 
cysteine,  followed  by  a  chase,  also  at  23  T. 
tor  30.  60  and  120  min  in  the  presence  of 
cycloheximide,  extraction,  immunoprecipitation. 
and  SDS/PAGE  of  Met-DHFR‘^ha  and 
Met-DHFR-ha-Ub*^'*^.  Lanes  e-h,  same  as  lanes 
a-d  but  with  Arg-DHFR"  (Mei-DHFR-ha- 
Ub^^''^-Arg-DHFR''-ha).  (B)  Lanes  a-d.  same  as 
lanes  a-d  in  (A),  except  that  both  the  pulse  and 
the  chase  were  at  37  °C.  Lanes  e-h,  same  as  lanes 
a-d.  but  with  Arg-DHFR'^ha  (Met-DHFR-ha- 
Ub^^^-Arg-DHFR’'-ha).  (C)  Lanes  a-d,  .same  as 
lanes  a-d  in  (B)  but  with  20  )xm  MTX  in  the 
medium.  Lanes  e-h,  same  as  lanes  a-d,  but  with 
Arg-DHFR’'  (Met-DHFR-ha-Ub'^'^''-Arg-DHFR'^ 
ha).  The  bands  of  Arg/Met-DHFR‘^-ha  and 
Mei-DHFR-ha-Ub'^^^  (denoted  as  dha-Ub)  are 
indicated.  A  larger  area  of  the  gel  is  shown  for 
lanes  a-h  in  (B),  to  illustrate  the  overall 
immunoprecipitation  pattern. 


The.se  t'u.sions  were  expressed  from  the  cytomegalovirus  early 
promoter.  P(  a/i  .  in  mouse  L  cells.  The  cells  were  labeled  for 
10  min  at  either  23  °C  or  37  °C  with  ['‘'S)methionine/cysteine. 
followed  by  a  chase  of  30,  60  and  120  min  at  the  same  tem¬ 
perature  in  the  presence  of  cycloheximide.  immunoprecipitation 
with  anti-ha  monoclonal  antibody,  and  analysis/quantitation  of 
immunoprecipitated  proteins  by  SDS/PAGE  and  Phosphor- 
Imager.  using  UPR  (see  Materials  and  methods).  Pulse-chase 
assays  in  which  cycloheximide  (a  translation  inhibitor)  was 
omitted  yielded  similar  results  (data  not  shown). 

Both  Met-DHFR'  and  Arg-DHFR''  were  long-lived  in  mouse 
cells  at  23  “C  (/0.5  >  10  h)  (Fig.  3A,  lanes  a-h,  and  Fig.  4A). 
By  contrast,  Arg-DHFR"  was  short-lived  at  37  “C  in  mouse 
cells;  its  half-life,  determined  between  0  and  60  min  of  chase 
and  denoted  as  [23],  was  «  10  min  (Fig.  3B.  lanes  e-h, 

and  Fig.  4A).  In  addition,  a  large  fraction  (»  30%)  of  the  pulse- 
labeled  Arg-DHFR'"  (but  not  of  Met-DHFR")  was  degraded 
during  the  10-min  pulse  at  37  °C.  as  could  be  seen  from 
comparing  the  UPR-normalized  amounts  of  Arg-DHFR"  at  time 
0  (the  beginning  of  the  chase)  at  23  °C  and  37  °C  (Fig.  4A).  In 
contrast  with  a  conventional  pulse-chase  assay,  the  use  of  UPR 
in  a  setting  where  a  protein  can  be  made  either  short-lived  or 
long-lived  allows  the  detection  and  measurement  of  proteolysis 
not  only  during  the  chase  but  during  the  pulse  as  well  [23],  The 
extent  of  degradation  of  a  protein  during  the  pulse  is  denoted  as 
ID'  [23],  In  the  present  context,  the  variable  ID'  is  defined  as  the 
extent  of  degradation  of  a  radiolabeled  short-lived  protein  (Arg- 
DHFR  at  37  °C)  at  the  end  of  a  pulse  relative  to  the  amount  of 
a  nearly  identical  protein  (Met-DHFR"  at  37  T)  that  is  long- 
lived  under  the  same  conditions. 

A  large  fraction,  but  not  all,  of  the  Arg-DHFR'"  degradation  in 
mouse  cells  was  carried  out  by  the  N-end  rule  pathway.  This 
could  be  seen  by  comparing  the  UPR-based  decay  curves  of 
Met-DHFR'"  and  Arg-DHFR'"  at  37  °C  (Fig.  3B  lanes  a-d  vs. 
lanes  e-h  and  Fig.  4A).  Specifically,  not  only  Arg-DHFR'"  but 
also  Met-DHFR'"  were  metabolically  unstable  at  37  °C  in  mouse 
cells,  the  corresponding  being  «  10  min  and  =  48  min. 
respectively  (Fig.  4A).  As  Met  is  a  stabilizing  residue  in  the 


N-end  rule  |23.30|.  we  infer  that  Arg-DHFR'"  was  targeted  not 
only  by  the  N-end  rule  pathway,  but  al.so  by  another  proteolytic 
system,  which  recognized  a  structural  feature  that  resulted  from 
a  conformational  perturbation  of  the  DHFR'"  moiety  at  37  °C. 
This  aspect  of  Arg-DHFR'"  acted  as  a  degron  in  mou.se  cells  but 
not  in  yeast,  inasmuch  as  the  same  protein  was  targeted 
exclusively  by  the  N-end  rule  pathway  in  S.  cen-visictc  at  37  °C 
(Fig.  2A  lanes  g-i  vs.  lanes  j-1).  The  nature  of  this  degradation 
signal  in  the  conformationally  perturbed  (at  37  °C)  DHFR'" 
moiety  is  not  known. 

Inhibition  of  both  degradation  signals  of  Arg-DHFR'"-ha  by 
MTX  in  mouse  cells 

Mouse  L  cells  expressing  Met-DHFR'"  (dha-Ub-Met-DHFR'")  or 
Arg-DHFR'"  (dha-Ub-Arg-DHFR'")  were  incubated  at  37  °C  for 
30  min  with  20  |xM  MTX  in  the  medium,  followed  by  a  10-min 
pulse  with  [’"Slmethionine/cysteine.  a  chase,  and  SDS/PAGE 
analysis.  The  degradation  of  Met-DHFR'"  at  37  °C.  which  is 
mediated  by  a  non-N-degron  in  the  conformationally  perturbed 
DHFR"  moiety  (see  above),  was  virtually  completely  inhibited 
by  MTX  (Fig.  3B  lanes  a-d  vs.  Fig.  3C  lanes  a-d  and  Fig.  4B). 
Specifically,  the  of  Met-DHFR'"  at  37  "C  was  «  48  min  in 
the  absence  of  MTX  and  >  10  h  in  the  presence  of  MTX 
(Fig.  4B).  The  much  more  rapid  degradation  of  Arg-DHFR'" 
carried  out  by  both  the  N-end  rule  pathway  and  the  other 
proteolytic  pathway  was  strongly  but  incompletely  inhibited  by 
MTX  (Fig.  3B  lanes  e-h  vs.  Fig.  3C  lanes  e-h  and  Fig.  4B). 
Specifically,  in  the  absence  of  MTX,  the  of  Arg-DHFR'" 
was  =  10  min  (ID"’  of  36%)  (Fig.  4B).  In  the  presence  of 
MTX,  these  variables  changed  to  /,);"’  =  37  min  and  ID"’  of 
29%  (Fig.  4B). 

A  comparison  of  the  corresponding  decay  curves  (Fig.  4), 
made  more  reliable  by  the  increased  accuracy  of  a  UPR-based 
pulse-chase  assay,  suggested  one  reason  for  the  leaky  inhibition 
of  the  N-degron  of  Arg-DHFR"  by  MTX.  Specifically,  the  ID"’ 
of  Met-DHFR'"  (i.e.  the  extent  of  Met-DHFR'"  degradation 
during  the  pulse)  was  <  10%  in  the  absence  of  MTX.  whereas 
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and  the  d,-iecognizing  pathway  (Fig.  5B.  lanes  d-f).  A  minor 
proteolytic  fragment  of  Arg-e-^^-DHFR''.  visible  in  Fig.  .5B-E 
specific  to  N-end  rule  substrates  bearing  an 
c  -type  extension:  an  analogous  fragment  was  also  obseiwed 
with  X-e^-Pgal-based  N-end  rule  substrates  1 16]. 

We  also  examined  the  metabolic  fate  of  Met-e’^’^-DHFR  (Ub- 
Met-e-^''-DHFR)  and  Arg-e'^'^-DHFR  (Ub-Arg-e-^^-DHFR). 
which  were  identical  with  the  substrates  above  except  that  they 
bore  the  wild-type  mouse  DHFR  moiety  (Fig.  1.  constructs  VI 
and^  VIl).  Met-e'^''-DHFR  was  long-lived  at  both  2.^  T  and 
51  C  in  mou.se  cells  (Fig.  5D.  lanes  a-c.  and  data  not  shown), 
indicating  that  the  heat-activated  d^-type  degron  of  substrates 
containing  the  DHFR''  moiety  (e.g.  Fig.  4A)  resulted  from  the 
Pro66  gr:  Leu66  mutation  that  yielded  this  moiety.  The 
degradation  of  Arg-e’^'^^-DHFR  was  mediated  exclusively  by 
the  N-end  rule  pathway  (Fig.  5D.  lanes  d-f  vs.  lanes  a-c), 
unlike  the  double-pathway  degradation  of  Arg-e‘^''-DHFR''  at 
37  C  (Fig.  5C.  lanes  d— f).  In  contrast  with  the  results  with  Arsj- 
DHFR"  and  Met-DHFR''.  the  presence  of  MTX  did  not  result  In 
a  significant  stabilization  of  Arg-e'^'^-DHFR  (Fig.  5D  lanes  d-f 
vs.  Figure  5E  lanes  d-f). 

DISCUSSION 

We  report  the  following  results. 

(a)  Arg-DHFR''  is  long-lived  at  23  °C  but  short-lived  at  37  °C 
in  both  S.  cerevi.siuc  and  mouse  cells. 

(b)  In  yeast.  Arg-DHFR"  is  degraded  (at  37  °C)  exclusively 
by  the  N-end  rule  pathway,  whereas  in  mouse  cells  the  same 
protein  at  the  same  temperature  is  degraded  by  another 
proteolytic  pathway  as  well.  The  corresponding  mouse-.specific 
degron  is  active  at  37  “C  but  inactive  at  23  °C.  This  degradation 
signal  is  a  currently  unknown  feature  of  the  perturbed  con¬ 
formation  of  the  DHFR''  moiety  at  37  °C.  which  differs  from  the 
wild-type  DHFR  moiety  by  a  Pro  —  Leu  alteration  at  position 
66. 

(c)  MTX,  a  low-A/r  ligand  of  DHFR  (/fj  =  10  pM),  inhibits 
the  degradation  of  Arg-DHFR"  (at  37  T)  nearly  completely  in 
yeast  and  partially  in  mouse  cells. 

(d)  The  ability  of  MTX  to  inhibit  the  degradation  of  Arg- 
DHFR''  in  vivo  is  retained  when  Arg-DHFR"  is  utilized  as  a 
portable  degron  that  confers  metabolic  instability  on  linked 
unrelated  proteins  in  yeast. 

(e)  Using  the  UPR  technique  [23]  to  determine  the  metabolic 
fate  of  Arg-DHFR"  in  mouse  cells,  we  found  that  »  36%  of  the 
labeled  Arg-DHFR''  molecules  are  degraded  during  the  10-min 
pulse,  and  that  the  presence  of  MTX  decreases  this  fraction  only 
slightly,  to  «=  30%.  This  finding  accounts  for  the  leaky  inhi¬ 
bition  of  Arg-DHFR''  degradation  by  MTX  in  mouse  cells. 
Specifically,  this  finding  suggests  that  a  large  fraction  of  the 
nascent  Arg-DHFR''  is  targeted,  in  mouse  cells,  by  the  N-end 
rule  pathway  before  the  conformation  of  the  DHFR''  moiety  is 
mature  enough  [28]  to  allow  high-affinity  binding  of  MTX.  That 
the  MTX-mediated  inhibition  of  degradation  of  Arg-DHFR''  in 
yeast  is  much  less  leaky  than  in  mouse  cells  is  but  one  difference 
in  the  detailed  properties  of  the  yeast  and  mammalian  N-end 
rule  pathways  [see  also  item  (0]. 

(f )  When  the  N-terminus  of  Arg-DHFR''  was  extended  with  a 
42-residue  lysine-lacking  extension,  termed  e'^'^.  the  N-degron 
of  the  resulting  Arg-e'^'^-DHFR'*  was  active  at  both  23  °C  and 
37  °C.  unlike  the  mouse-specific  degron,  which  remained 
inactive  at  23  °C.  Moreover,  the  degradation  of  Arg-e"^*^- 
DHFR''.  in  contrast  with  that  of  Arg-DHFR*',  could  not  be 
inhibited  by  MTX,  suggesting  that  the  inactivity  of  the  N-degron 
in  Arg-DHFR''  at  23  "C  results  at  least  in  part  from  inaccessibility 


of  its  N-terminal  arginine  to  the  targeting  complex  of  the  N-end 
rule  pathway.  Arg-e‘^''-DHFR.  which  bore  the  wild-type  DHFR 
moiety,  was  also  short-lived  in  mouse  cells.  Previously,  the 
same  protein  was  lound  to  be  long-lived  in  yeast,  unle.ss  the  e'^*' 
extension  was  replaced  by  an  otherwise  identical  extension, 
termed  e'',  which  contained  Lys  residues  that  functioned  as  the 
second  determinant  of  the  N-degron  1 16). 

Potential  applications  of  a  heat-inducible.  MTX-suppressible 
portable  N-degron  include  its  use  to  produce  conditional 
mutants  in  homeothermic  animals  .such  as  mou.se.  One  difficulty 
in  the  current  approaches  to  this  problem  [3 1  -33 )  is  that  either  a 
deletion  or  tran.scriptional  repression  of  a  gene  of  interest  leaves 
the  previously  produced  gene  product  unperturbed.  If  this 
protein  is  long-lived  and  resides  in  a  non-dividing  cell,  there 
may  be  a  considerable  phenotypic  lag  between  the  conditionally 
introduced  genetic  change  and  the  actual  inactivation  or  dis¬ 
appearance  of  a  corresponding  protein.  A  degron  that  can  be 
regulated  by  a  cell-penetrating  ligand  can  be  employed  to 
address  this  problem.  Since  chronic  admini.stration  of  MTX  is 
toxic  to  manimal.s.  it  will  be  necessarv'  to  construct  an  analogous 
DHFR-based  N-degron  that  uses  £.  coli  DHFR  and  is  inhibited 
by  trimethoprim,  which  binds  tightly  to  the  E.  coli  but  not  the 
mou.se  DHFR  [.34j.  If  a  portable  N-degron  of  this  kind  could  be 
constructed,  combining  it  w  ith  a  conditional  repression  of  a  gene 
of  interest  should  yield  better  methods  for  prixlucing  conditional 
mutants  in  homeothermic  animals. 

MTX,  through  its  binding  to  the  substrate  pocket  of  DHFR, 
stabilizes  DHFR  conformation  [35,36].  This  effect  was 
employed  in  several  studies  with  DHFR  as  a  reporter  protein. 
Foi  example,  the  binding  of  MTX  to  DHFR  can  preclude,  under 
certain  conditions,  the  translocation  of  a  DHFR-containing 
fusion  protein  across  biological  membranes  [37,38).  MTX  can 
partially  protect  DHFR  against  in  vitro  proteolysis  by  thermo- 
lysin  (38,39).  MTX  has  also  been  shown  to  block  the  degra¬ 
dation  of  a  targeted  multiubiquitylated  Arg-e*^-DHFR  in 
reticulocyte  extract  [27).  In  the  latter  study,  the  N-end  rule 
pathway  in  reticulocyte  extract  was  presented  with  a  prefolded 
wild-type  DHFR  moiety.  By  contrast,  in  the  present  work  with 
intact  cells,  the  same  pathway  could  target  a  DHFR-based  N-end 
rule  substrate  immediately  after  (and  possibly  even  during)  its 
synthesis.  Further  analysis  is  required  to  clarify  the  apparent 
discrepancy  between  the  previously  observed  inhibition  of 
degradation  of  Arg-e^-DHFR  by  MTX  in  vitro  (in  reticulocyte 
extract)  [27]  and  the  absence  of  a  comparable  effect  of  MTX  in 
vivo  on  this  class  of  substrates,  in  which  N-terminal  Arg  is 
constitutively  exposed.  One  difference  between  the  two  settings 
is  the  presentation  of  prefolded  DHFR  moieties  in  reticulocyte 
extract  compared  with  a  kinetic  competition  between  the  folding 
of  newly  formed  DHFR  moieties  and  their  targeting  for 
degradation  by  the  N-end  rule  pathway. 

The  use  of  the  UPR  technique  not  only  increased  the  overall 
accuracy  of  pulse-chase  assays  with  mouse  cells  but  also 
showed  that  =«  30%  of  the  newly  made  Arg-DHFR*^  was 
degraded  during  the  pulse.  The  finding  that  the  fraction  of  Arg- 
DHFR^  degraded  during  the  pulse  decreased  only  slightly  in  the 
presence  of  MTX  strongly  suggests  that  most  of  the  labeled  Arg- 
DHFR’"  is  conformationally  immature  shortly  after  synthesis, 
and  therefore  cannot  form  a  high-affinity  complex  with  MTX. 
The  Pro  Leu  alteration  at  position  66  that  yielded  DHFR’" 
occurred  in  the  region  of  DHFR  that  interacts  with  the  aromatic 
ring  of  MTX  [35],  and  is  likely  to  have  resulted  in  a  decreased 
affinity  of  DHFR  for  MTX.  Nonetheless,  the  test  proteins 
containing  the  DHFR‘"  moiety  were  efficiently  retained  on  an 
MTX  affinity  column  (data  not  shown),  and  their  in  vivo 
degradation  could  be  specifically  inhibited  by  MTX. 
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A  model  of  targeting  by  the  N-end  rule  pathway  that  accounts 
for  our  findings  with  the  heat-inducible  N-degron  assumes  that 
the  main  cause  of  metabolic  stability  of  Arg-DHFR*"  at  23  T  is 
not  the  absence  of  a  sterically  accessible  Lys  residue  within 
the  folded  DHFR  globule,  but  rather  a  steric  inaccessibility  of 
the  N-terminal  Arg  residue.  In  this  model,  the  activation  of  the 
previously  cryptic  N-degron  of  Arg-DHFR*"  at  37  °C  is  caused 
by  an  increased  exposure  and  flexibility  of  a  region  bearing  the 
N-terminal  Arg.  a  transition  that  leads  to  the  binding  of  Arg  by 
the  targeting  complex  of  the  N-end  rule  pathway.  According  to 
this  interpretation,  MTX  stabilizes  Arg-DHFR*"  against  degra¬ 
dation  at  37  °C  not  by  precluding  the  conformational  mobili¬ 
zation  of  internal  Lys  residues,  but  by  precluding  a  temperature- 
mediated  increase  in  the  exposure  of  the  N-terminal  Arg.  In 
addition  to  being  consistent  with  the  available  evidence,  this 
model  accounts  for  the  otherwise  puzzling  result  that  Arg-c^*^- 
DHFR‘‘'-ha  (which  differs  from  Arg-DHFR’"’  by  the  presence  of 
e"^*^)  is  short-lived  at  both  23  and  37  °C,  whereas  Arg- 
DHFR’"  is  short-lived  only  at  37  T.  Indeed,  the  e*^^  extension 
would  result  in  a  temperature-independent  enhanced  exposure  of 
N-terminal  Arg,  a  change  that  would  be,  in  this  model,  sufficient 
for  the  temperature-independent  activity  of  the  N-degron  in  Arg- 
e‘^’^-DHFR‘\  The  earlier  model  proposed  by  Dohmen  ct  oL  [19], 
which  presumed  that  the  inactivity  of  the  N-degron  in  Arg- 
DHFR'"  at  23  °C  was  caused  by  the  absence  of  sterically 
accessible  Lys  residues,  cannot  account  for  the  above  result. 

How  can  we  verify  the  Arg-exposure  model?  One  way  is 
suggested  by  the  existence,  in  both  yeast  and  mammals,  of  the 
enzymes  N-terminal  amidase  (Nt-amidase)  and  Arg-tRNA- 
protein  transferase  (R-transferase),  which  chemically  modify 
specific  (sterically  exposed)  N-terminal  residues  of  proteins  in 
the  cytosol.  In  5.  cercvislae.  the  yV7A /-encoded  Nt-amidase 
deamidates  N-terminal  Asn  or  Gin:  the  ATEZ-encoded  R- 
transferase  conjugates  Arg  to  N-terminal  Asp  or  Glu  (reviewed 
in  1 10]).  A  test  of  the  model  would  involve  the  expression  of,  for 
example,  Asn-DHFR'"  (identical  to  Arg-DHFR*^  except  for  the 
presence  of  N-terminal  Asn)  in  yeast  cells  that  lack  UbrIp,  a 
component  of  the  N-end  rule  pathway  that  recognizes  primary 
destabilizing  N-terminal  residues  such  as  Arg.  (The  absence  of 
Ubrlp  would  preclude  the  degradation  of  test  proteins.)  The 
model  predicts  that  at  23  °C  the  N-terminal  Asn  of  Asn-DHFR’" 
would  be  deamidated  inefficiently  or  not  at  all,  in  contrast  with 
what  happens  at  37  °C.  This  can  be  verified  by  isolating  Asn- 
DHFR*''  from  cells  incubated  at  either  23  or  37  °C,  and 
determining  its  N-terminal  residue.  If  the  model  is  correct,  the 
N-terminal  sequence  must  begin  largely  or  entirely  with  Asn  at 
23  ®C,  but  would  become  Arg-Asp-...  at  37  °C.  (In  the  latter 
case,  the  N-terminal  Asp,  produced  from  Asn,  would  be 
arginylated  by  R-transferase.)  Experimental  verification  of  this 
model  is  under  way. 
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ABSTRACT  The  insufficient  selectivity  of  drugs  is  a  bane 

of  present-day  therapies.  This  problem  is  significant  for 
antibacterial  drugs,  difficult  for  antivirals,  and  utterly  un¬ 
solved  for  anticancer  drugs,  which  remain  ineffective  against 
major  cancers,  and  in  addition  cause  severe  side  effects.  The 
problem  may  be  solved  if  a  therapeutic  agent  could  have  a 
multitarget,  combinatorial  selectivity,  killing,  or  otherwise 
modifying,  a  cell  if  and  only  if  it  contains  a  predetermined  set 
of  molecular  targets  and  lacks  another  predetermined  set  of 
targets.  An  earlier  design  of  multitarget  drugs  [Varshavsky, 
A.  (1995)  Proc.  Natl  Acad.  ScL  USA  92,  3663-3667]  was 
confined  to  macromolecular  reagents  such  as  proteins,  with 
the  attendant  difficulties  of  intracellular  delivery  and  immu- 
nogenicity.  I  now  propose  a  solution  to  the  problem  of  drug 
selectivity  that  is  applicable  to  small  (^1  kDa)  drugs.  Two 
Ideas,  codominant  interference  and  antieffectors,  should  allow 
a  therapeutic  regimen  to  possess  combinatorial  selectivity,  in 
which  the  number  of  positively  and  negatively  sensed  macro- 
molecular  targets  can  be  two,  three,  or  more.  The  nature  of  the 
effector  and  interference  moieties  in  a  multitarget  drug 
determines  its  use;  selective  killing  of  cancer  cells  or,  for 
example,  the  inhibition  of  a  neurotransmitter-inactivating 
enzyme  in  a  specific  subset  of  the  enzyme-containing  cells.  The 
in  vivo  effects  of  such  drugs  would  be  analogous  to  the 
outcomes  of  the  Boolean  operations  “and,”  “or,”  and  combi¬ 
nations  thereof.  I  discuss  the  logic  and  applications  of  the 
antieffector  and  interference/codominance  concepts,  and  the 
attendant  problem  of  pharmacokinetics. 


The  many  successes  of  pharmacology  (1, 2)  do  not  include  the 
problem  of  cancer.  Major  human  cancers  are  incurable  once 
they  have  metastasized.  A  few  relatively  rare  cancers,  such  as 
testicular  carcinoma  in  men,  Wilms*  kidney  tumor,  and  some 
leukemias  in  children,  can  often  be  cured  through  chemother¬ 
apy  but  require  cytotoxic  treatments  of  a  kind  that  cause  severe 
side  effects  and  are  themselves  carcinogenic  (2). 

The  main  reason  for  the  failure  of  cytotoxic  therapies  is  their 
insufficient  selectivity  for  tumors.  For  example,  treatments 
with  radiation  or  alkylating  agents  perturb  many  functions  that 
are  common  to  all  cells.  The  more  selective  cytotoxic  drugs,  for 
instance,  methotrexate,  taxol,  and  etoposide,  perturb  the 
functions  of  speciftc  macromolecular  targets  (dihydrofolate 
reductase,  microtubules,  and  topoisomerase  II),  but  these 
targets  are  present  in  both  normal  and  malignant  cells  (1,  2). 
Hence  the  low  therapeutic  index  of  anticancer  drugs  and  their 
systemic  toxicity  at  clinically  relevant  doses.  Because  the 
mitotic  activity  of  cells  in  a  tumor  is  often  lower  than  the 
mitotic  activity  of  normal  cells  in  self-renewing  tissues  such  as 
the  bone  marrow  (3),  one  might  not  have  expected  these  drugs 
to  work  at  all — ^to  have  any  preference  for  the  killing  of  cancer 
cells.  That  such  preference  actually  exists  stems  in  part  from 
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the  fact  that  tumor  cells  are  often  perturbed  by  their  mutations 
into  stress-hypersensitive  states.  Consequently,  these  tumor 
cells  die  an  apoptotic  death  at  the  level  of  a  drug-imposed 
metabolic  stress  that  induces  apoptosis  in  some  but  not  in  most 
of  the  organism’s  normal  cells  (3). 

With  some  cancers,  cytotoxic  therapies  are  ineffective  from  the 
beginning.  In  other  cases,  these  therapies  yield  a  partial,  some¬ 
times  clinically  complete,  but  almost  invariably  transient  remis¬ 
sion  of  a  cancer,  in  part  because  these  treatments  select  for  tumor 
cell  variants  that  retain  tumorigenicity  but  are  more  resistant  to 
either  apoptosis  per  yc  or  a  drug  that  induces  apoptosis.  Because 
a  significant  increase  in  the  drug  or  radiation  dosage  is  precluded 
by  their  low  therapeutic  index,  these  therapies  become  ineffective 
when  resistant  clones  of  malignant  cells,  selected  by  a  drug 
treatment,  present  themselves  as  a  cancer  recurrence. 

The  failure  of  small  cytotoxic  drugs  to  produce  a  cure  for 
cancer  has  given  rise  to  other  strategies,  in  particular  the  insightful 
suggestion  that  solid  tumors  can  be  targeted  by  selectively  inhib¬ 
iting  neovascularization,  a  process  that  these  tumors  depend  on 
for  growing  to  a  clinically  significant  size  (4).  Another  approach, 
immunotoxins,  involves  the  linking  of  a  toxin  to  a  ligand  such  as 
an  antibody  or  a  growth  factor  that  binds  to  a  target  on  the  surface 
of  tumor  cells  (5).  Among  the  limitations  of  present-day  immu¬ 
notoxins  is  their  incapacity,  on  entering  a  cell,  to  adjust  their 
toxicity  in  response  to  the  intracellular  protein  composition.  Yet 
another  approach  is  to  enhance  the  ability  of  the  immune  system 
to  identify  and  selectively  destroy  tumor  cells.  The  current  revival 
of  this  strategy  holds  the  promise  of  a  rational  and  curative 
treatment  (6).  Given  the  complicated  regimens  and  unsolved 
problems  of  immunotherapies,  it  is  clear  that  this  and  other  recent 
approaches  (7,  8)  are  motivated  in  part  by  the  perception  that 
small-drug  pharmacology,  so  successful  against  bacterial  infec¬ 
tions,  is  unlikely  to  prove  effective  against  cancer.  In  contrast  to 
this  view,  the  premise  of  the  strategy  described  in  the  present 
work  is  that  small  anticancer  drugs  may  become  curative  and  free 
of  severe  side  effects  if  a  way  is  found  to  confer  on  these 
compounds  a  multitarget,  combinatorial  selectivity. 

Most  cancers  are  monoclonal:  cell  lineages  of  both  the  primary 
tumor  and  the  metastases  originate  from  a  single  founder  cell. 
This  cell  is  a  breakthrough  descendant  of  a  cell  lineage  that  has 
been  accumulating  mutations  for  some  time,  often  in  proximity 
to  other  neoplastic  but  still  nonmetastatic  cell  lineages  within  an 
indolent  proliferative  lesion  such  as  a  benign  tumor  (9, 10).  Given 
the  monoclonality  of  a  cancer,  cells  of  both  the  primary  tumor  and 
the  metastases  share  the  initial  mutations  that  yielded  the 
founder  cell,  even  if  these  cells  differ  at  other  loci  that  accumu¬ 
lated  mutations  in  the  course  of  the  later  tumor  progression. 
Some  of  the  early  mutations  are  in  genes  that  encode  tumor 
suppressors  (9, 11, 12).  In  most  cancers,  both  alleles  of  a  tumor 
suppressor  gene  are  inactivated,  sometimes  through  deletions 
that  encompass  the  gene  on  the  two  homologous  chromosomes. 
Thus,  a  monoclonal  cancer,  although  heterogeneous  genetically. 
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always  contains  a  set  of  founder  mutations  that  is  shared  by  all  of 
its  cells. 

A  drug  that  kills  a  cell  only  if  it  lacks  a  specific  macromo- 
lecular  target  would  distinguish  tumor  cells  from  many  other 
cells  of  an  organism,  provided  that  the  target  is  a  product  of 
a  gene  that  had  been  deleted  or  inactivated  in  this  cancer  at  the 
stage  of  its  founder  cell.  Such  a  drug  may  be  especially  selective 
against  cancers  that  lack  a  gene  for  a  ubiquitously  expressed 
tumor  suppressor,  for  example,  the  retinoblastoma  (Rb)  pro¬ 
tein  (11, 12).  An  example  of  the  negative-target  approach  is  the 
use  of  a  mutant  adenovirus  that  replicates  selectively  in  human 
cancer  cells  lacking  the  tumor  suppressor  p53  and  has  been 
shown  to  kill  these  cells  in  a  model  setting  (8), 

However,  other  tumor  suppressors  may  not  be  expressed  at 
comparable  levels  in  most  cells.  A  drug  that  kills  a  cell  if  it  lacks 
a  nonubiquitous  tumor  suppressor  would  be  toxic  to  a  subset 
of  normal  cells  as  well.  This  problem  could  be  reduced  through 
the  use  of  a  drug  that  is  toxic  only  if  a  cell  lacks  two  specific 
macromolecules,  termed  negative  targets.  Two  judiciously 
chosen  negative  targets  may,  together,  suffice  to  distinguish  all 
of  the  cancer  cells  from  all  of  the  organism’s  normal  cells.  If 
they  do  not,  a  third  negative  target  that  had  been  deleted  or 
rendered  defective  in  a  given  cancer  can  be  employed  as  well. 
This  strategy  requires  a  drug  that  possesses  the  ability  to  kill 
a  cell  if  it  lacks  two  or  more  of  the  predetermined  targets,  but 
would  spare  a  cell  containing  either  one  of  these  targets. 

Other  changes  in  a  founder  cell  may  involve  a  missense 
mutation,  an  amplification  and  overexpression,  an  ectopic 
expression,  or  a  translocation/fusion  of  a  specific  protoonco¬ 
gene  such  as,  for  example,  Ras  or  Myc  (9,  10,  13).  A  single 
oncoprotein  may  not  be  a  unique  enough  target  by  itself,  for 
reasons  similar  to  those  described  above  in  the  context  of 
negative  targets.  However,  a  combination  of  two  or  more 
distinct  oncoproteins  that  were  either  mutated  or  inappropri¬ 
ately  expressed  in  the  founder  cell  can  be  employed  to 
formulate  the  unique  multiprotein  signature  of  a  specific 
cancer  that  comprises  both  positive  and  negative  targets. 

These  considerations  suggest  that  a  conditionally  cytotoxic 
therapeutic  regimen  that  is  exquisitely  specific  for  a  given 
cancer,  and  therefore  would  eliminate  it  without  significant 
side  effects,  must  possess,  in  most  cases,  a  multitarget,  com¬ 
binatorial  (positive/negative)  selectivity  of  the  kind  defined 
above.  Conversely,  even  an  informed  choice  of  the  molecular 
target  for  a  single-target  drug  may  not  suffice  to  define 
unambiguously  the  cell  type  to  be  eliminated.  Note  that  simply 
combining  two  single-target  drugs  against  two  different  targets 
in  a  multidrug  regimen  would  not  yield  a  multitarget  selectiv¬ 
ity,  because  the  two  drugs  together  would  perturb  not  only  cells 
containing  both  targets  but  also  cells  containing  either  one  of 
the  targets. 

Although  the  problem  of  insufficient  selectivity  is  not  as 
acute  with  noncytotoxic  drugs,  it  is  relevant  to  them  as  well. 
Among  the  multitude  of  examples  are  side  effects  of  therapies 
with  antipsychotic  agents.  The  side  effects  are  caused  in  part 
by  the  insufficient  molecular  specificity  of  drugs,  which  is 
exemplified  by  the  ability  of  antidepressants  that  inhibit  mono¬ 
amine  oxidase  to  perturb  other  proteins  as  well  (2).  This 
difficulty  will  continue  to  abate  with  the  development  of  more 
specific  single-target  inhibitors.  But  an  entirely  distinct,  major, 
and  unsolved  problem  with  inhibitors  as  drugs  is  the  current 
impossibility  of  restricting  their  action  to  a  specific  subset  of 
cells  among  those  that  contain  the  inhibitor’s  target.  For 
example,  even  an  exquisitely  specific  inhibitor  of  a  clinically 
relevant  enzyme  is  likely  to  have  side  effects,  because  the 
target  enzyme  is  present,  in  most  cases,  not  only  in  the  cells 
where  its  inhibition  is  clinically  beneficial  but  also  in  the  cells 
where  its  inhibition  is  physiologically  inappropriate.  The 
present  work  describes  a  possible  solution  of  this  problem. 

A  previously  proposed  approach  to  designing  multitarget  drugs 
utilized  degradation  signals  (degrons)  and  analogous  signals  that 


exhibit  the  property  of  codominance  (14,  15).  As  a  result,  this 
strategy  was  confined  to  macromolecular  reagents  such  as  pro¬ 
teins,  with  the  attendant  problems  of  immunogen icity,  extrava¬ 
sation,  and  intracellular  delivery.  The  latter  difficulty  is  especially 
significant,  because  either  gene-therapy  or  direct-delivery  meth¬ 
ods  for  introducing  large  molecules  into  cells  work  reasonably 
well  with  cells  in  culture  but  are  still  inefficient  with  cells  in  an 
intact  organism.  The  challenge,  then,  is  to  attain  a  multitarget, 
combinatorial  selectivity  in  the  setting  of  small  (:^1  kDa)  drugs, 
where  the  immunogenicity  and  delivery  problems  are  less  severe. 
A  solution,  described  below,  invokes  a  modification  of  the  earlier 
idea  of  codominant  interference  (14)  in  conjunction  with  the  new 
concept  of  antieffectors.  This  solution  is  applicable  to  either 
cytotoxic  or  noncytotoxic  therapies. 

RESULTS  AND  DISCUSSION 

Multitarget  Compounds  Specific  for  Negative  Targets:  The 
Concept  of  Codominant  Interference.  Previous  work  (14) 
suggested  that  the  property  of  codominance,  characteristic  of 
degradation  signals  (degrons)  and  many  other  signals  in 
biopolymers,  can  be  employed  to  design  protein-based  re¬ 
agents  that  possess  multitarget,  combinatorial  selectivity  of  the 
kind  defined  above.  Codominance  refers  to  the  ability  of  two 
or  more  signals  in  the  same  molecule  to  function  indepen¬ 
dently  and  not  to  interfere  with  each  other.  It  is  shown  below 
that  a  distinct  version  of  the  interference/codominance  (IC) 
concept  (14)  is  applicable  to  small  (=£1  kDa)  compounds. 
Consider  a  reagent  containing  three  small  moieties  a,  b,  and 
i,  which  can  bind,  respectively,  to  three  macromolecular  targets 
A,  B,  and  1.  Because  the  moieties  a,  b,  and  i  are  much  smaller 
than  the  macromolecules  A,  B,  and  I,  it  should  be  possible  to 
arrange  these  moieties  in  the  compound  abi  in  such  a  way  that 
the  binding  of  A  or  B  to  a  or  b  would  preclude,  through  steric 
hindrance,  the  binding  of  moiety  i  to  I  (Fig.  1). 

That  the  interactions  of  a  small  bipartite  compound  with  its 
two  macromolecular  ligands  can  be  made  mutually  exclusive  is 
expected  from  basic  physicochemical  considerations.  This  has 
also  been  demonstrated  directly,  in  a  context  unrelated  to  the 
present  discussion.  When  lisinopril,  an  inhibitor  of  the  angio¬ 
tensin-converting  enzyme  (ACE),  was  connected,  via  an  11- 
atom  linker,  to  the  biotin  moiety,  the  resulting  bivalent  com¬ 
pound  could  bind  to  and  inhibit  ACE  in  the  absence  but  not 
in  the  presence  of  the  biotin-binding  protein  streptavidin  (16). 
Small  compounds  comprising  two  linker-connected  moieties 
such  as  cyclosporin  and  FK506,  which  are  specific  for  two 
macromolecular  targets,  have  previously  been  employed  as  in 
vivo  dimerization  devices,  making  it  possible  to  bring  together 
two  otherwise  noninteracting  proteins  (17).  However,  the 
linker  moiety  of  these  bipartite  compounds  was  chosen  to  allow 
simultaneous  interactions  with  the  targets,  in  contrast  to  the 
mutual  exclusivity  of  interactions  in  the  IC  approach  (Fig.  1). 

If  the  moiety  i  is  an  inhibitor  of  an  essential  cellular  enzyme  I, 
the  presence  of  the  macromolecular  targets  A  or  B  in  a  cell  would 
reduce  the  inhibition  of  enzyme  I  by  abi,  because  the  complexes 
abi-A  and  abi-B  would  be  mutually  exclusive  with  the  complex 
abi-I  (Fig.  1).  Note  that  A  and  B  are  codominant  in  their  ability 
to  reduce  the  inhibition  of  I  by  abi.  Therefore,  there  is,  formally, 
no  limit  on  the  number  of  a,  b-like  competition  modules  that  can 
be  used  to  construct  an  abl-like  compound  whose  activity  is 
sensitive  to  the  presence  of  several  distinct  macromolecules, 
called  negative  targets.  The  fractional  occupancy  of  the  macro¬ 
molecular  targets  A,  B,  and  I  by  the  a,  b,  and  i  moieties  of  abi 
would  be  determined  in  part  by  the  targets’  intracellular  concen¬ 
trations.  There  are  also  specific  pharmacokinetic  constraints  on 
the  selectivity  of  abi,  an  issue  discussed  below. 

A  tabulation  of  the  relative  toxicities  of  abi  for  cells  that  either 
lack  or  contain  the  negative  targets  A  and  B  is  shown  in  Fig.  2.  It 
can  be  seen  that  abi  would  be  relatively  nontoxic  to  three  of  the 
four  cell  types  and  toxic  exclusively  to  the  cells  that  lack  both  A 
and  B  (Fig.  2).  Thus,  the  IC  concept  allows  the  construction  of 
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Fig.  1.  The  interfcrence/codomi- 
nance  concept.  (1-4)  A  small  (<1 
kDa)  moiety  i  is  linked  to  two  other 
small  moieties,  a  and  b.  The  moieties 
a,  b,  and  i  are  ligands  of  the  macro- 
molecules  A,  B,  and  I,  respectively. 
The  distances  between  a,  b,  and  i,  and 
their  mutual  arrangement  in  the  tri¬ 
partite  compound  abi  are  such  that 
the  interaction  i-I  is  mutually  exclu¬ 
sive  with  either  the  interaction  a-A  or 
the  interaction  b-B.  Specifically,  the  macromolecule  I  in  its  complex  with  the  small  moiety  i  would  sterically  clash  with  the  macromolecules  A  or 
B  if  either  A  or  B  is  positioned  to  bind  a  or  b  of  abi  (5  and  6),  In  the  diagram,  the  interactions  a-A  and  b-B  are  also  mutually  exclusive,  but  this 
constraint  is  not  essential.  (Note  that  if  the  interactions  a-A  and  b-B  were  mutually  nonexclusive,  the  compound  abi  would  promote  the  binding 
of  A  to  B.)  The  codominance  aspect  of  the  IC  concept  allows  this  design  to  accommodate  more  than  two  of  the  a,  b-like  competition  modules  (not 
shown).  In  Figs.  2-4,  the  i  moiety  is  an  inhibitor  of  an  essential  enzyme  I.  In  fact,  the  only  constraint  on  the  identities  of  i  and  I  is  the  requirement 
for  an  i-I  interaction  to  alter  the  functional  activity  of  a  macromolecule  I.  In  other  words,  the  choice  of  I  is  determined  by  the  intended  effect  of 
the  (unsequestered)  compound  abi  (see  the  main  text). 

small  compounds  that  exhibit  multitarget  selectivity  for  negative 
targets.  One  more  idea  is  required  to  accomplish  the  same  for 
positive  targets  and  to  link  the  two  strategies. 

Multitarget  Compounds  Specific  for  Positive  Targets:  The 
Concept  of  Antieffectors.  Consider  a  small  compound  i*  that 
binds  to  enzyme  I  in  the  vicinity  of  its  active  site,  but  does  not 
perturb  the  catalytic  activity  of  I  toward  its  physiological 
substrates  (Fig.  3B).  Suppose  further  that  the  compound  i*, 
termed  an  antiinhibitor,  was  designed  to  interfere,  sterically, 
with  the  binding  of  an  inhibitor  i  to  the  enzyme’s  active  site 
while  at  the  same  time  allowing  the  binding  of  physiological 
substrates.  One  way  to  achieve  this  would  be  to  endow  either 
i,  or  !♦,  or  both  of  them  with  a  set  of  chemical  groups,  termed 
a  ‘‘bump,”  whose  function  is  to  produce  steric  hindrance  that 
makes  the  interactions  i-I  and  i*-I  mutually  exclusive  (Fig.  3  A 
and  B).  A  moiety  that  functions  as  a  bump  may  also  be 
designed  to  enhance  specific  binding  of  either  the  inhibitor  i  or 
the  antiinhibitor  to  enzyme  I,  but  this  consideration  is 
secondary  to  the  bump’s  essential  purpose. 

In  one  application  of  the  antiinhibitor  i*,  it  is  linked  to  c,  a 
small  moiety  that  can  bind  to  a  macromolecular  target  C.  The 
mutual  arrangement  of  !♦  and  c  in  ci*  is  such  that  the 
interactions  of  d*  with  I  and  C  are  mutually  exclusive.  In  the 
absence  of  C,  cl*  would  compete  with  abi  for  the  binding  to 


Fig.  2.  Multitarget  selectivity  of  a 
compound  that  utilizes  interference/ 
codominance.  This  diagram  tabulates 
the  relative  toxicities  of  the  compound 
abi  for  cells  that  either  lack  or  contain 
macromolecular  targets  A  and  B.  The  i 
moiety  of  the  compound  abi  (see  the 
legend  to  Fig.  1)  inhibits  an  essential 
enzyme  I.  The  interaction  i-1  is  mutu¬ 
ally  exclusive  with  the  interaction  a-A 
and  the  interaction  b-B,  the  macromol¬ 
ecules  A  and  B  being  negative  targets  of 
abi.  It  is  assumed  that  concentrations 
of  the  targets  A  and  B  in  cells  that 
contain  at  least  one  of  them  signifi¬ 
cantly  exceed  the  concentration  of  I 
(see  the  main  text).  In  A"^  A"^  B“, 

and  A“  cells,  the  enzyme  I  would 
be  at  most  partially  inhibited  by  the  i 
moiety  of  abi,  because  of  the  compet¬ 
ing  interactions  of  abi  with  A  and/or  B. 
By  contrast,  in  A"  B"  cells,  the  bulk  of 
abi  molecules  would  be  available  for 
interaction  with  I,  resulting  in  the  se¬ 
lective  toxicity  of  abi  to  these  cells.  The 
selectivity  pattern  of  abi  requires  that 
certain  pharmacokinetic  conditions 
are  met  as  well  (see  the  main  text). 
Note  that  the  physiological  effects  and 
the  uses  of  abi-type  compounds  are  not 
confined  to  cytotoxic  regimens. 
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enzyme  I,  thereby  partially  protecting  I  from  inhibition  by  abi 
(Fig.  3C).  This  protective  effect  of  ci^  would  be  suppressed  in 
the  presence  of  its  macromolecular  target  C  (Fig.  3D).  In  the 
logic  of  codominance,  discussed  above  in  the  context  of 
negative  targets,  a  compound  bearing  an  antiinhibitor  moiety 
I*  could  contain  more  than  one  c-like  moiety.  For  example,  a 
compound  cdi*,  whose  moieties  c  and  d  can  bind,  respectively, 
to  the  macromolecules  C  and  D,  would  reduce  the  inhibition 
of  enzyme  I  by  abi  only  in  the  absence  of  both  C  and  D.  Yet 
another  pattern  of  multitarget  selectivity  can  be  produced,  in 
this  context,  by  separating  the  competition  moieties  c  and  d. 
The  resulting  ci*  and  di*,  if  administered  together  with  abi, 
would  reduce  the  inhibition  of  enzyme  I  by  abi  if  just  one  of 
the  targets,  C  or  D,  is  absent.  As  shown  below,  the  key  merit 
of  the  antiinhibitor  idea  is  that  it  allows  the  effect  of  a  single 
inhibitor  i  to  be  modulated  by  both  negative  and  positive 
macromolecular  targets. 

On  the  Difference  Between  Antieffectors  and  Antagonists. 
The  distinctions  between  substrates,  inhibitors,  and  antiinhibi¬ 
tors  were  described  above.  The  concept  of  antieffectors  is  also 
relevant  to  ligand-binding  biopolymers  other  than  enzymes. 
For  example,  an  agonist  binds  to  its  receptor  and  evokes  a 
physiological  response.  An  antagonist  binds  to  a  site  of  the 
receptor  that  overlaps  with  the  agonist-binding  site,  does  not 
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Fig.  3.  The  antieffector  concept.  The  particular  case  illustrated 
here  is  that  of  an  antiinhibitor  I*,  defined  as  a  compound  whose 
binding  to  an  enzyme  1  does  not  inhibit  the  activity  of  I  but  does 
preclude  the  inhibition  of  I  by  an  inhibitor  i.  In  this  example,  the 
antiinhibitor  i*  has  the  following  properties.  First,  it  binds  to  I  in  the 
vicinity  of  the  I*s  active  site,  but  does  not  perturb  the  catalytic  activity 
of  I  toward  its  physiological  substrates.  Second,  i*,  in  its  bound  state, 
sterically  interferes  with  the  interaction  between  I  and  its  inhibitor  i. 
To  implement  the  second  condition,  either  i,  or  !♦,  or  both  of  them 
bear  additional  moiety,  a  “bump,”  denoted  by  the  rectangular  pro¬ 
trusions  in  i  and  i*.  The  function  of  the  bump  is  to  produce  steric 
hindrance  that  makes  the  interactions  i-I  and  IM  mutually  exclusive. 
The  inhibitor  i  described  here  and  in  the  main  text  is  a  competitive 
inhibitor,  but  i  could  be  a  noncompetitive  inhibitor  as  well.  An 
allosteric  antiinhibitor,  which  functions  through  binding  to  a  remote 
site  of  enzyme  I,  is  yet  another  possibility.  (^)  A  complex  of  the 
enzyme  I  with  its  inhibitor  i.  (B)  A  complex  of  I  with  its  antiinhibitor 
I*.  Note  that  the  bumps  of  the  bound  i  and  !*  spatially  overlap.  (C)  The 
antiinhibitor  P  is  linked  to  c,  a  small  moiety  that  can  bind  to  a 
macromolecular  target  C.  The  design  of  cl*  is  analogous  to  abi  (Figs. 
1  and  2),  in  that  the  interactions  of  ci*  with  1  and  C  are  mutually 
exclusive.  When  cl*  is  bound  to  I,  the  inhibitor  I,  shown  here  as  a  part 
of  the  compound  abl  (Figs.  1  and  2),  is  unable  to  bind  to  and  inhibit 
the  enzyme  I.  (D)  A  complex  between  cl*  and  its  macromolecular 
target  C.  This  complex,  being  mutually  exclusive  with  the  cl*-I 
complex,  reduces  the  ability  of  cl*  to  protect  the  enzyme  I  from 
inhibition  by  abl. 

activate  the  receptor,  and  in  addition  precludes  the  binding  of 
agonist  (1,  2).  By  contrast,  an  antieffector,  which  would  be 
called,  in  this  setting,  an  antiantagonist,  binds  to  the  receptor 
in  such  a  way  that  the  receptor  can  still  bind,  and  respond  to, 
the  agonist,  but  cannot  bind  the  antagonist.  To  this  end,  either 
an  antagonist,  or  an  antiantagonist,  or  both  must  possess  a 
bump,  an  additional  moiety  described  above  in  the  context  of 


enzymes  and  antiinhibitors  (Fig.  3).  The  idea  of  antieffectors 
is  thus  distinct  from  that  of  antagonists  or  inhibitors  and  is  new, 
to  the  best  of  my  knowledge.  I  am  also  not  aware  of  a  naturally 
occurring  pair  of  compounds  that  satisfy  the  definition  of 
effectors/antieffectors  in  a  physiologically  relevant  setting. 

Interference/Codominance  and  Antieffector  in  a  Regimen 
That  Possesses  Combinatorial  Selectivity.  Applying  the  IC  and 
antieffector  concepts  together  yields  regimens  that  possess  true 
combinatorial  selectivity,  i.e.,  sensitivity  to  both  negative  and 
positive  targets.  Consider  a  population  of  cells  that  either  contain 
or  lack  the  macromolecular  targets  A,  B,  and  C.  Our  aim  is  to 
devise  a  treatment  that  would  be  toxic  to  cells  that  lack  A  and  B 
but  contain  C  (A“  B”  cells)  and  relatively  nontoxic  to  the 
other  cell  types  (Fig.  4).  A  regimen  of  two  compounds,  abi  (Fig. 
3C)  and  ci*  (Fig.  3D),  has  the  requisite  selectivity,  as  shown  in  Fig. 
4,  which  tabulates  the  outcomes  of  this  treatment  for  different  cell 
types.  Specifically,  in  the  absence  of  C  (four  cell  types  out  of 
eight),  the  antiinhibitor-containing  cl*  would  compete  with  the 
inhibitor-containing  abi  for  binding  to  the  essential  enzyme  I, 
thereby  reducing  the  inhibition  of  I  by  abi,  and  hence  reducing  the 
toxicity  of  abi.  In  three  other  cell  types,  whose  common  property 
is  the  presence  of  C  and  at  least  one  of  the  other  two  targets,  A 
or  B,  the  antiinhibitor-containing  ci*  would  be  largely  seques¬ 
tered  by  C,  and  hence  inactive,  but  the  inhibitor-containing  abi 
would  be  sequestered  as  well,  by  either  A  or  B.  In  only  one  type 
of  cells,  those  that  lack  A  and  B  but  contain  C  (A”  B‘  cells), 
is  the  inhibitor  abi  fiilly  available  for  interaction  with  I,  resulting 
in  higher  toxicity  (Fig.  4).  The  differences  in  the  toxicity  of  abl  to 
different  cell  types  would  be  determined  by  the  relative  stoichio¬ 
metries  and  absolute  concentrations  of  the  cellular  targets  in¬ 
volved  (A,  B,  C,  and  I),  by  the  affinities  of  the  moieties  a,  b,  c,  i, 
and  i*  for  these  targets,  and  by  the  pharmacokinetic  properties  of 
abi  and  ci*. 

Straightforward  variations  of  the  abl  and  ci*  designs  that 
utilize  IC  and  the  properties  of  antiinhibitors  would  allow 
selective  targeting  of  any  one  of  the  eight  cell  types  that  differ 
by  the  presence  or  absence  of  three  macromolecular  targets. 
Moreover,  there  is  no  formal  limit  on  the  total  number  of 
negative  and/or  positive  targets  that  can  be  simultaneously 
sensed  by  regimens  that  employ  abi-  and  ci*-type  compounds 
bearing  multiple  interference  moieties.  Note  that  the  cell  type 
selectivity  of  regimens  such  as  abi  +  ci*  (Fig.  4)  is  analogous 
to  the  outcomes  of  the  Boolean  operations  “and,”  “or,”  and 
combinations  thereof. 

Stoichiometries,  Affinities,  and  Pharmacokinetics.  The  se¬ 
lectivity  of  the  proposed  compounds  results  from  mutually 
exclusive,  competing  interactions  between  individual  moieties 
of  these  compounds  and  their  macromolecular  targets  (Figs. 
2-4);  hence,  the  importance  of  the  targets’  intracellular  con¬ 
centrations,  relative  to  each  other  and  the  enzyme  I,  which  is 
inhibited  by  an  effector  moiety  of  these  drugs.  The  choice  of 
I  is  not  confined  to  essential  enzymes.  The  target  1  could  be, 
for  instance,  a  DNA-binding  repressor  of  terminal  differenti¬ 
ation,  a  repressor  of  apoptosis,  or,  in  the  example  of  a 
noncytotoxic  therapy,  a  neurotransmitter-inactivating  enzyme. 
In  other  words,  the  choice  of  I  is  determined  by  the  intended 
effect  of  the  (unsequestered)  compound  abi. 

The  sequestration  of  abi-  and  ci*-type  compounds  by  their 
macromolecular  ligands  A,  B,  and  C  serves  to  prevent  their 
binding  to  the  enzyme  I  (Figs.  3  and  4).  Therefore,  in  schemes 
of  the  type  considered  above,  the  molar  concentration  of  I 
should  be  significantly  (if  possible,  considerably)  lower  than 
the  molar  concentrations  of  A,  B,  and  C.  In  addition,  the 
concentration  of  ci*  in  a  ci*  +  abi  regimen  should  significantly 
exceed  that  of  abi,  because  ci*  is  the  sole  obstacle  to  the 
inhibition  of  enzyme  I  by  abi  in  the  A*"  B“  C“  cells  (Fig.  4). 
It  is  assumed,  furthermore,  that  the  total  intracellular  con¬ 
centrations  of  abi  and  ci*,  bound  and  unbound,  would  remain 
significantly  below  the  concentrations  of  the  interference 
targets  A,  B,  and  C.  The  affinities  of  A,  B,  C,  and  I  for  the 
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Fig.  4.  Combinatorial  (positive/ 
negative)  selectivity  of  a  regimen  that 
utilizes  interference/codominance 
and  antiinhibitor.  The  diagram  tabu¬ 
lates  the  relative  toxicities  of  the  com¬ 
pound  abi  in  the  setting  of  a  two- 
compound  treatment  of  cells  that  ei¬ 
ther  lack  or  contain  the 
macromolecular  targets  A,  B,  and  C. 
The  inhibitor-containing  compound 
abi  is  described  in  the  main  text  and 
Figs.  1,  2,  and  3C.  The  antiinhibitor- 
containing  compound  ci*  is  described 
in  the  main  text  and  Fig.  3.  A  and  B  are 
negative  targets,  in  that  they  reduce, 
through  the  binding  to  the  moieties  a 
and  b  of  abi,  the  inhibition  of  an 
essential  enzyme  1  by  abi.  C  is  a 
positive  target,  in  that  it  reduces, 
through  the  binding  to  the  c  moiety  of 
c|*,  the  binding  of  ci*  to  enzyme  I. 
This  results  in  a  larger  fraction  of 
enzyme  I  available  for  the  inhibition 
by  abi.  It  is  assumed  that  the  concen¬ 
trations  of  A,  B,  and  C  in  cells  that 
contain  them  significantly  exceed  the 
concentration  of  I  (see  the  main  text). 
In  all  of  the  cell  types  except  A" 

C'^,  the  enzyme  I  would  be  at  most 
partially  inhibited  by  the  i  moiety  of 
abi,  because  of  the  competing  inter¬ 
actions  of  abi  with  A  and/or  B,  and 
also  because  in  cells  a  fraction  of 
enzyme  1  would  be  protected  from  the 
inhibition  by  abi  through  the  interac¬ 
tion  of  1  with  the  antiinhibitor  moiety 
i*  of  ci*.  By  contrast,  in  A“  B“ 
cells,  the  antiinhibitor-containing  ci* 
would  be  sequestered  by  C,  whereas 
abi  would  not  be  sequestered  by  A  or 
B,  which  are  absent  from  these  cells. 
As  a  result,  a  larger  fraction  of  the  inhibitor-containing  abi  molecules  would  be  available  for  the  interaction  with  I,  resulting  in  the  selective  toxicity 
of  abi  to  A“  B“  cells.  The  selectivity  pattern  of  abi  requires  that  certain  pharmacokinetic  conditions  are  met  as  well  (sec  the  main  text).  Note 
that  the  physiological  effects  and  the  uses  of  abi-type  compounds  are  not  confined  to  cytotoxic  regimens. 
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respective  moieties  of  the  compounds  abi  and  ci*,  and  also  the 
targets’  intracellular  locations  are  among  the  independent 
parameters  that  can  be  varied  in  designing  these  compounds. 

Yet  another,  and  major,  constraint  on  the  nature  and 
pharmacokinetics  of  multitarget  drugs  stems  from  the  fact  that 
the  selectivity  patterns  described  above  (Figs.  2  and  4)  may  not 
be  observed  under  equilibrium  conditions,  where  the  influx  of 
a  drug  into  cells  equals  its  outflux.  To  illustrate  one  clear 
difficulty,  let  us  oversimplify  and  suppose  that  an  abi-type 
compound  is  metabolically  inert,  in  addition  to  being  capable 
of  crossing  the  plasma  membranes  and  other  lipid  bilayers.  If 
abi  is  initially  outside  the  cells,  and  if  the  extracellular  pool  of 
abi  (e.g.,  in  the  blood  plasma)  is  large  enough,  it  can  be  shown 
that  the  subsequently  reached  equilibrium  state  would  be 
characterized  by  equal  concentrations  of  the  free  abi  in  both 
the  A***  and  A“  B”  cells,  thereby  resulting  in  the  equal 
occupancies  of  the  enzyme  I  by  abi  in  these  cells,  contrary  to 
the  pattern  illustrated  in  Fig.  2. 

By  contrast,  the  selectivity  patterns  of  Figs.  2  and  4  would  be 
observed  during  the  initial  inHux  of  drugs  into  cells.  Thus,  one 
requirement  for  the  multitarget  selectivity  of  abi-  and  ci*-type 
regimens  is  the  avoidance  of  equilibrium  states  such  as  the  one 
described  above.  This  and  related  considerations  indicate  that 
despite  the  logical  simplicity  of  the  proposed  designs,  their 
implementation  will  have  to  address  pharmacokinetic  prob¬ 
lems  that  do  not  necessarily  arise  with  single-target  drugs. 

Selection  of  Targets  and  Construction  of  Multitarget  Drugs. 
Appropriate  macromolecular  targets  of  the  A-C  class  (Figs.  3  and 
4)  are  suggested  1^  the  protein  composition  of  the  tumor  cells  to 


be  eliminated.  The  choice  of  an  essential  intracellular  enzyme  I 
(Figs.  2-4)  is  determined  by  the  presence  of  I  at  least  in  tumor  cells, 
its  physiological  concentration,  and  the  feasibility  of  an  efficacious 
inhibitor  of  I.  Among  potentially  suitable  enzymes  for  which 
cell-penetrating  inhibitors  already  exist  is  dihydrofolate  reductase. 
Its  high-affinity  inhibitors  include  methotrexate,  which  enters  cells 
through  carrier-mediated  pathways,  and  the  more  lipophilic  tri- 
metrexate,  which  can  enter  cells  by  diffusing  through  lipid  bilayers 
(18). 

Each  of  the  modules  in  the  abi-  and  ci*-type  compounds  (Figs. 
1-4)  would  bind  its  macromolecular  target  in  the  absence  of  the 
other  modules.  Therefore,  the  cytotoxic  i-type  modules  of  IC- 
based  compounds  would  be  similar  to  the  stand-alone  cytotoxic 
drugs  of  today.  By  contrast,  the  interference  modules  of  these 
compounds,  i.e.,  their  a-,  b-,  and  c-type  moieties  (Figs.  1-3),  are 
supposed  to  bind  to  their  macromolecular  targets  but  preferably 
not  impair  them  functionally.  This  specification  of  a  competition 
module  simplifies  its  design  in  comparison  to  that  of  inhibitors, 
because  many  sites  on  the  target’s  surface,  and  not  just  the  active 
site,  would  be  acceptable. 

Antiinhibitors  (Fig.  3)  are  a  new  class  of  physiologically  active 
compounds.  The  opportunities  and  problems  of  their  design  are 
similar  to  those  for  the  interference  moieties  a-c  (Figs.  1-3),  but 
there  are  two  other  difficulties  as  well.  First,  the  antiinhibitor 
moiety  i*  must  bind  in  the  vicinity  of,  but  not  at,  the  active  site 
of  enzyme  I.  Second,  the  moiety  i*  must  also  bear  chemical 
groups  (a  bump)  whose  function  is  to  preclude,  through  steric 
hindrance,  the  binding  of  the  inhibitor  moiety  i  to  the  enzyme  I 
(Fig.  3). 
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A  substrate-binding  cleft  is  not  the  only  indentation  in  a 
folded  protein  molecule.  The  other  clefts  tend  to  be  smaller, 
but  they  are  present  as  well  (19),  and  some  of  them  may  be 
located  next  to  the  enzyme’s  active  site  (Fig.  3).  In  addition, 
even  relatively  flat  molecular  surfaces  can  be,  in  principle,  the 
sites  of  high-affinity  interactions  with  small  ligands  (20).  These 
optimistic  comments  notwithstanding,  the  development  of 
antiinhibitors  is  certain  to  be  a  complex  undertaking.  As 
discussed  above,  the  pharmacokinetic  aspects  of  the  proposed 
designs  are  also  complex.  Yet  simplicity  is  good  only  if  it  works. 
Single-target  anticancer  drugs  remain  unsatisfactory,  in  spite 
of  decades  of  immense  effort.  It  may  therefore  be  wise  to 
attempt  a  more  complex  but  also  more  effective  solution. 

A  recent  advance  in  drug  design,  termed  SAR  by  NMR 
(structure-activity  relationships  by  nuclear  magnetic  reso¬ 
nance),  provides  an  especially  promising  route  to  constructing 
ligands  for  specific  regions  of  a  protein  molecule  (21).  In  this 
approach,  a  library  of  small  molecules  is  screened  for  binding 
to  an  *^N-labeled  protein  by  using  NMR,  which  can  detect 
weak  interactions,  and  in  addition  assigns  them  to  specific 
nitrogens  of  a  protein,  thereby  identifying  the  site  of  binding. 
Finding  two  small  compounds  that  bind  to  adjacent  patches  of 
the  target  protein  molecule  and  covalently  linking  these  com¬ 
pounds  produces  a  higher-affinity  ligand.  This  powerful  strat¬ 
egy  (21),  which  already  yielded  tightly  binding  ligands  of 
specific  proteins,  may  prove  sufficient  for  constructing  the  a-c 
competition  modules  and  the  i*  antiinhibitor  modules  of  the 
proposed  designs  (Figs.  1-4). 

Noncytotoxic  Multitarget  Drugs.  Many  useful  drugs  are  the 
inhibitors  of  intracellular  enzymes  that  are  not  essential  for  cell 
viability  (2).  The  problem  of  insufficient  selectivity  is  relevant 
to  these  drugs  as  well.  For  example,  even  an  exquisitely  specific 
inhibitor  of  a  clinically  relevant  enzyme  is  likely  to  have 
significant  side  effects,  because  the  target  enzyme  is  present, 
in  most  cases,  not  only  in  the  cells  where  its  inhibition  is 
clinically  beneficial  but  also  in  the  cells  where  its  inhibition  is 
physiologically  inappropriate.  The  logic  of  abi-type  inhibitors 
(Figs.  1  and  2)  and  ci*-type  antiinhibitors  (Figs.  3  and  4)  is 
applicable  in  these  settings,  because  an  informed  choice  of  the 
competition  moieties  a,  b,  and  c  would  sharpen  up  the  cell 
selectivity  of  the  inhibitor  moiety  i  in  the  way  described  above 
for  cytotoxic  drugs  (Fig.  4),  resulting  in  the  inhibition  of  the 
(nonessential)  enzyme  I  in  a  predetermined  subset  of  the 
enzyme-containing  cells.  Note  that  the  same  considerations 
apply  to  extracellular  settings  as  well.  The  examples  above  are 
but  a  glimpse  of  the  drug-engineering  vistas  that  are  opened 
up  by  the  IC  and  antieffector  concepts.  At  the  same  time,  there 
are  significant  pharmacokinetic  constraints  on  the  properties 
of  the  proposed  drugs,  as  discussed  above.  These  constraints 
are  likely  to  complicate  the  implementation  of  the  IC/ 
antieffector  strategies. 

The  Problem  of  Drug  Resistance.  With  small  anticancer 
drugs  that  are  in  use  today,  the  macromolecular  target  of  a 
drug  serves  two  distinct  fonctions.  First,  the  target  is  a 
cell-selectivity  determinant  that  may  bias  the  treatment  against 
tumor  cells.  Second,  the  target  is  also  a  device  whose  inhibition 
by  the  drug  brings  about  the  desired  effect,  e.g.,  cell  death. 
Consequently,  when  drug-resistant  tumor  cells,  selected  by  a 
drug  treatment,  present  themselves  as  a  cancer  recurrence,  the 
necessity  of  employing  another  therapeutic  agent  (if  such  an 
option  exists)  robs  the  physician  of  whatever  cell-selectivity 
advantage  there  was  with  the  earlier  drug. 

The  situation  is  qualitatively  different  with  IC-based  com¬ 
pounds.  Suppose  that  a  treatment  that  included  the  drug  abi 
(Figs.  1  and  2)  results  in  the  appearance  of  abi-resistant  tumor 
cells  that  contain,  for  example,  an  altered  or  overproduced 
enzyme  I.  If  so,  replacement  of  the  i  moiety  by  another  small 
cytotoxic  moiety,  specific  for  another  essential  enzyme,  would 
retain  the  cell  selectivity  of  the  new  ab-containing  drug.  Thus, 
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one  advantage  of  modularity  inherent  in  the  designs  of  IC/ 
antiinhibitor-based  compounds  (Figs.  1-4)  lies  in  the  separa¬ 
tion  of  the  effector  aspect  of  a  drug  from  its  selectivity  aspect. 
As  a  result,  once  an  efficacious  arrangement  of  the  selectivity 
modules  in  abi-  or  ci*-type  compounds  has  been  identified,  it 
can  be  reutilized  in  drugs  bearing  effector  moieties  other  than 
i  and  i*. 

Concluding  Remarks.  The  above  considerations  are  based 
on  the  existing  understanding  of  single-target  drugs  and  on  the 
notion  of  steric  hindrance.  By  introducing  the  new  concept  of 
antieffectors  and  a  modification  of  the  previously  proposed 
idea  of  codominant  interference  (14),  we  can  now  attempt  the 
construction  of  small  modular  compounds  that  possess  a 
multitarget,  combinatorial  selectivity  (Fig.  4).  The  IC/ 
antieffector  strategies  are  not  confined  to  cytotoxic  therapies 
and  are  relevant,  in  principle,  to  all  pharmacological  settings. 
As  indicated  above,  one  expected  difficulty  in  implementing 
these  strategies  stems  from  significant  pharmacokinetic  con¬ 
straints  that  do  not  necessarily  arise  with  single-target  drugs. 

This  work  was  motivated  by  the  premise  that  the  confine¬ 
ment  of  anticancer  drug  research  and  development  to  single¬ 
target  compounds  will  prove  insufficient  for  the  task  at  hand, 
because  even  the  informed  choices  of  targets  for  such  drugs 
may  not  define  unambiguously  enough  the  cell  type  to  be 
eliminated.  The  remedy,  described  above,  is  to  aim  for  drugs 
that  possess  qualitatively  different  selectivity — multitarget  and 
combinatorial.  If  this  view  is  correct,  the  future  ascent  of 
multitarget  drugs  may  transform  not  only  the  treatment  of 
cancer  but  also  approaches  in  other  settings  where  the  killing 
or  modification  of  undesirable  cells  or  organelles  is  carried  out 
in  the  presence  of  nearly  identical  cells  or  organelles  that  must 
be  spared.  These  applications  of  multitarget  drugs  encompass 
more  discriminating  antiviral  and  antifungal  therapies,  as  well 
as  the  selective  killing  of  activated  lymphocytes  in  autoimmune 
diseases  and  the  selective  elimination  of  damaged  mitochon¬ 
dria  in  aging  cells  (14, 15).  In  yet  another  class  of  applications, 
a  noncytotoxic  multitarget  drug  would  be  used  to  inhibit  a 
clinically  relevant  nonessential  enzyme  in  a  specific  subset  of 
the  enzyme-containing  cells,  thereby  retaining  the  benefits  of 
inhibition  while  reducing  its  side  effects. 
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The  N-end  rule  relates  the  in  vivo  half-life  of  a  protein  to  the  identity  of  its  N-terminal  residue.  The 
underlying  ubiquitin-dependent  proteolytic  system,  called  the  N-end  rule  pathway,  is  organized  hierarchically: 
N-terminal  aspartate  and  glutamate  (and  also  cysteine  in  metazoans)  are  secondary  destabilizing  residues,  in 
that  they  function  through  their  conjugation,  by  arginyl-tRNA-protein  transferase  (R-transferase),  to  arginine, 
a  primary  destabilizing  residue.  We  isolated  cDNA  encoding  the  516-resldue  mouse  R-transferase,  ATElp,  and 
found  two  species,  termedy4/ei-7  mdiAtel-2.  Ih^Atel  mRNAs  are  produced  through  a  most  unusual  alternative 
splicing  that  retains  one  or  the  other  of  the  two  homologous  129-bp  exons,  which  are  adjacent  in  the  mouse  Ate  1 
gene.  Uumun  ATEl  also  contains  the  alternative  129-bp  exons,  whereas  the  plant  {Arabidopsis  thaliana)  and  fly 
(Drosophila  melanogaster)  Atel  genes  encode  a  single  form  of  ATElp.  A  fusion  of  ATEl-lp  with  green  fluorescent 
protein  (GFP)  is  present  in  both  the  nucleus  and  the  cytosol,  whereas  ATEl-2p-GFP  is  exclusively  cytosolic. 
Mouse  ATEl-lp  and  ATEl-2p  were  examined  by  expressing  them  in  fl/eJA  Saccharomyces  cerevisiae  in  the 
presence  of  test  substrates  that  included  Asp-Pgal  (P-galactosidase)  and  Cys-Pgal.  Both  forms  of  the  mouse 
R-transferase  conferred  instability  on  Asp-pgal  (but  not  on  Cys-Pgal)  through  the  arginylation  of  its  N- 
terminal  Asp,  the  ATEl-lp  enzyme  being  more  active  than  ATEl-2p.  The  ratio  oIAteDl  to  AteD2  mRNA  varies 
greatly  among  the  mouse  tissues;  it  is  ~0.1  in  the  skeletal  muscle,  *^0.25  in  the  spleen,  ~3.3  in  the  liver  and 
brain,  and  '-lO  in  the  testis,  suggesting  that  the  two  R-transferases  are  functionally  distinct. 


The  half-lives  of  intracellular  proteins  range  from  a  few 
seconds  to  many  days.  The  rates  of  processive  proteolysis  are  a 
function  of  the  cell’s  physiological  state  and  are  controlled 
differentially  for  specific  proteins.  In  particular,  most  of  the 
damaged  or  otherwise  abnormal  proteins  are  metabolically 
unstable.  Many  other  proteins,  while  long-lived  as  components 
of  larger  macromolecular  structures  such  as  ribosomes  and 
oligomeric  proteins,  are  metabolically  unstable  as  free  sub¬ 
units.  Regulatory  proteins  are  often  also  short-lived  in  vivo, 
providing  a  way  to  generate  their  spatial  gradients  and  to 
rapidly  adjust  their  concentrations,  or  subunit  compositions, 
through  changes  in  the  rate  of  their  synthesis  or  degradation 
(20,  23,  28,  39,  44,  55). 

The  posttranslational  conjugation  of  arginine  (Arg)  to  the  N 
termini  of  eukaryotic  proteins  was  described  35  years  ago  (26), 
but  the  function  of  this  modification,  and  of  the  enzyme  in¬ 
volved,  Arg-tRNA-protein  transferase  (R-transferase)  (47),  re¬ 
mained  unknown  until  the  discovery  that  the  identity  of  N- 
terminal  residue  in  a  protein  influences  its  metabolic  stability 
(4).  The  resulting  relation  was  termed  the  N-end  rule  (54). 
Aspartate  (Asp)  and  glutamate  (Glu),  the  two  N-terminal  res¬ 
idues  known  to  be  arginylated  by  R-transferase  (47),  were 
shown  to  be  destabilizing  residues  in  the  N-end  rule  (4).  It  was 
therefore  proposed  (4)  that  the  function  of  R-transferase  is  to 
target  proteins  for  degradation  by  conjugating  Arg,  one  of  the 
primary  destabilizing  residues,  to  secondary  destabilizing  N- 
terminal  residues  (Asp  and  Glu  in  fungi;  Asp,  Glu,  and  Cys  in 
metazoans  (18)  (Fig.  1).  It  was  also  proposed  (4)  that  the 
analogous  prokaryotic  enzyme  Leu,  Phe-tRNA-protein  trans¬ 
ferase  (L,  F-transferase)  (47)  mediates  the  activity  of  N-termi- 
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nal  Arg  and  Lys,  which,  in  prokaryotes,  would  be  the  secondary 
destabilizing  residues.  These  conjectures  were  confirmed  (7, 
17,  45,  53). 

The  similar  but  distinct  degradation  signals  which  together 
give  rise  to  the  N-end  rule  are  called  the  N-degrons  (54,  56).  In 
eukaryotes,  an  N-degron  comprises  two  determinants:  a  desta¬ 
bilizing  N-terminal  residue  and  an  internal  Lys  residue  of  a 
substrate  (5,  22).  The  Lys  residue  is  the  site  of  formation  of  a 
substrate-linked  multiubiquitin  chain  (11).  The  N-end  rule 
pathway  is  thus  one  pathway  of  the  ubiquitin  (Ub)  system.  Ub 
is  a  76-residue  protein  whose  covalent  conjugation  to  other 
proteins  plays  a  role  in  a  multitude  of  processes,  including  cell 
growth,  division,  differentiation,  and  responses  to  stress  (20, 
23, 39, 55).  In  many  of  these  processes,  Ub  acts  through  routes 
that  involve  the  degradation  of  Ub-protein  conjugates  by  the 
26S  proteasome,  an  ATP-dependent  multisubunit  protease  (9, 
13,  40,  43). 

(In  the  text  that  follows,  names  of  mouse  genes  are  in  italics, 
with  the  first  letter  uppercase.  Names  of  human  and  Saccha¬ 
romyces  cerevisiae  genes  are  also  in  italics,  all  uppercase.  If 
human  and  mouse  genes  are  named  in  the  same  sentence,  the 
mouse  gene  notation  is  used.  Names  of  S.  cerevisiae  proteins 
are  roman,  with  the  first  letter  uppercase  and  an  extra  lower¬ 
case  “p”  at  the  end.  Names  of  the  corresponding  mouse  and 
human  proteins  are  the  same,  except  that  all  letters  but  the  last 
“p”  are  uppercase.  The  latter  usage  is  a  modification  of  the 
existing  convention  (50),  to  facilitate  simultaneous  discussions 
of  yeast,  mouse,  and  human  proteins.  In  some  citations,  the 
abbreviated  name  of  a  species  precedes  the  gene’s  name.) 

The  N-end  rule  is  organized  hierarchically.  In  the  yeast  5. 
cerevisiae,  Asn  and  Gin  are  tertiary  destabilizing  N-terminal 
residues  in  that  they  function  through  their  conversion,  by  the 
A^7M7-encoded  N-terminal  amidase  (Nt-amidase)  (6),  to  the 
secondary  destabilizing  N-terminal  residues  Asp  and  Glu.  The 
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FIG.  1 .  Comparison  of  enzymatic  reactions  that  underlie  the  activity  of  the 
tertiary  and  secondary  destabilizing  residues  among  eukaryotes.  (A)  Mammals 
(reference  54  and  this  work):  (B)  the  yeast  5.  cerevisiae  (5).  N -terminal  residues 
are  indicated  by  single-letter  abbreviations  for  amino  acids;  ovals  denote  the  rest 
of  a  protein  substrate.  The  Man /-encoded  mammalian  Nt-amidase  converts 
N-terminal  Asn  to  Asp,  whereas  N-terminal  Gin  is  deamidated  by  a  distinct 
Nt-amida.se  that  remains  to  be  identified  (19,  51).  In  contrast,  the  yeast  Nt- 
amidase  can  deamidatc  either  N-terminal  Asn  or  Clin  (6).  The  secondary  desta¬ 
bilizing  residues  Asp  and  GIu  are  arginylated  by  the  mammalian  ATE  1 -Ip  or 
ATEl-2p  R-transferase  (see  Results).  A  Cys-spccific  mammalian  R-transferase 
remains  to  be  identified  (see  Results).  N-terminal  Arg,  one  of  the  primary 
destabilizing  residues  (54),  is  recognized  by  N-recognin  (E3)  (see  the  introduc¬ 
tion). 


destabilizing  activity  of  N-terminal  Asp  and  Glu  requires  their 
conjugation,  by  the  S.  cerevisiae  y4r£/-encoded  Arg-tRNA- 
protein  transferase  (R-transferase),  to  Arg,  one  of  the  primary 
destabilizing  residues  (7)  (Fig.  IB).  In  mammals,  the  deami¬ 
dation  step  is  bifurcated,  in  that  two  distinct  Nt-amidases  spe¬ 
cific,  respectively,  for  N-terminal  Asn  and  Gin,  mediate  the 
activity  of  tertiary  destabilizing  residues  (19,  51)  (Fig.  lA). 
Mice  lacking  the  Asn-specific  Nt-amidase  NTANlp  have  re¬ 
cently  been  produced  through  targeted  mutagenesis  and  found 
to  be  fertile,  outwardly  normal,  but  behaviorally  distinct  from 
their  congenic  wild-type  counterparts  (28a).  In  mammals,  the 
set  of  secondary  destabilizing  residues  contains  not  only  Asp 
and  Glu  but  also  Cys,  which  is  a  stabilizing  residue  in  yeast  (18, 
54)  (Fig.  1). 

The  primary  destabilizing  N-terminal  residues  are  bound 
directly  by  the  t/BR7-encoded  N-recognin  (also  called  E3a), 
the  recognition  component  of  the  N-end  rule  pathway  (8).  In  S. 
cerevisiae,  N-recognin  is  a  225-kDa  protein  that  binds  to  po¬ 
tential  N-end  rule  substrates  through  their  primary  destabiliz¬ 
ing  N-terminal  residues,  Phe,  Leu,  Tip,  Tyr,  lie,  Arg,  Lys,  and 
His  (54).  The  Ubrl  genes  encoding  mouse  and  human  N- 
recognins,  also  called  Ea  (21,  41),  have  been  cloned  (29),  and 
mouse  strains  lacking  Ubrl  have  recently  been  constructed 
(29a). 

The  known  functions  of  the  N-end  rule  pathway  include  the 
control  of  peptide  import  in  5.  cerevisiae,  through  the  degra¬ 
dation  of  Cup9p,  a  transcriptional  repressor  of  PTR2  which 
encodes  the  peptide  transporter  (2, 10);  a  role  in  regulating  the 
Slnlp-dependent  phosphorylation  cascade  that  mediates  os¬ 
moregulation  in  5.  cerevisiae  (38);  the  degradation  of  Gpalp,  a 
Ga  protein  of  S.  cerevisiae  (34);  and  the  conditional  degrada¬ 
tion  of  alphaviral  RNA  polymerase  in  virus-infected  metazoan 
cells  (16,  54).  Physiological  N-end  rule  substrates  were  also 
identified  among  the  proteins  secreted  into  the  host  cell’s  cy¬ 
tosol  by  intracellular  parasites  such  as  the  bacterium  Listeria 
monocytogenes  (46).  Inhibition  of  the  N-end  rule  pathway  was 
reported  to  interfere  with  mammalian  cell  differentiation  (24) 
and  to  delay  limb  regeneration  in  amphibians  (52).  Microar- 
ray-based  comparisons  of  gene  expression  patterns  irv  wild-type 
and  congenic  ubrl^  strains  of  5.  cerevisiae  have  shown  that  a 
number  of  yeast  genes,  of  diverse  functions,  are  significantly 


up-  or  down-regulated  in  the  absence  of  the  N-end  rule  path¬ 
way  (39a). 

The  mammalian  counterpart  of  the  yeast  ATEl-encodcd 
R-transferase  was  partially  purified  from  rabbit  reticulocytes 
and  shown  to  cofractionate  with  Arg-tRNA  synthetase  (12). 
Recent  studies  of  the  Ub-dependent  proteolysis  of  endogenous 
proteins  in  muscle  extracts  suggested  that  the  N-end  rule  path¬ 
way  plays  a  major  role  in  catabolic  states  that  result  in  muscle 
atrophy  (48, 49).  A  significant  fraction  of  the  N-end  rule  path¬ 
way’s  activity  in  muscle  extracts  was  found  to  be  tRNA  depen¬ 
dent,  indicating  the  involvement  of  R-transferase  (48,  49).  It 
was  also  reported  that  a  crush  injury  to  the  rat  sciatic  nerve 
results  in  a  —10-fold  increase  in  the  rate  of  arginine  conjuga¬ 
tion  to  the  N  termini  of  unidentified  proteins  in  the  nerve’s 
region  upstream  of  the  crush  site  (15, 57),  suggesting  an  injury- 
induced  increase  in  the  concentration  of  R-transferase  sub¬ 
strates  and/or  an  enhanced  activity  of  the  N-end  rule  pathway. 

In  this  work,  we  began  the  functional  analysis  of  mammalian 
R-transferase  (ATE Ip)  by  isolating  mouse  cDNA  encoding 
this  enzyme.  Surprisingly,  we  found  two  Atel  cDNA  species, 
which  were  identical  except  for  a  129-bp  region  that  encoded 
similar  but  distinct  sequences.  One  or  the  other,  but  not  both, 
of  the  corresponding /l/e7  exons  is  retained  in  the  mature 
mRNA,  and  the  ratio  of  the  resulting  two  species, /I /e7- 7  and 
Atel-2,  varies  greatly  among  mouse  tissues.  We  also  show  that 
ATEl-lp  and  ATEl-2p,  while  differing  in  activity,  can  argin- 
ylate  N-terminal  Asp  and  Glu  in  model  substrates.  However, 
neither  of  them  can  arginylatc  N-terminal  Cys,  the  known 
secondary  destabilizing  residue  (18),  suggesting  the  existence 
of  a  distinct  Cys-specific  mammalian  R-transferase. 

MATERIALS  AND  METHODS 

Strains  and  plasmids.  The  S.  cerevisiae  strains  used  were  JD55  (A//17a  ura3-52 
his3-tS200  Ieu2-3,112  trpI-A63  Ivs2-S01  uhr}A::HIS3)  (34)  and  SGY3  {MATa 
ura3-52lys2-80}  ade2-Wl  //7>/>A63//A?-A2^i0/ea2-A/a/e/-A2::L£//2)  (19a),  Cells 
were  grown  in  rich  medium  (YPD)  or  in  synthetic  media  (SD)  containing  0.67% 
yeast  nitrogen  base  without  amino  acids  (Difeo).  auxotrophic  nutrients,  and  2% 
glucose.  To  induce  the  Pc:At.  promoter,  glucose  was  replaced  by  2%  galactose 
(SG  medium).  Transformation  of  5.  cerevisiae  was  performed  by  the  lithium 
acetate  method  (3). 

The  ithrJA  ate] A  double  mutant  AVY34  was  constructed  by  replacing  93%  of 
the  ATEl  open  reading  frame  (ORF)  (the  first  470  codons)  in  strain  JD55 
(uhrlA)  by  the  LEU2  gene,  through  homologous  recombination  (42)  with  the 
introduced  LEU2  gene  flanked  on  either  side  by  40  bp  of  >17£/-specific  se¬ 
quences.  Mutants  were  selected  on  SD  (lacking  Leu  and  His)  plates,  and  Leu"^ 
isolates  were  checked  by  PCR  for  the  absence  of  ATE  1  and  by  colony  assays  (7, 
8)  for  the  absence  of  Ate  Ip  and  Ubrlp  activity.  High-copy-number  pUB23-X 
plasmids  expressing  Ub-X-^gal  proteins  (see  below)  from  PaAi,  **1  cerevisiae 
have  been  described  elsewhere  (4).  Mouse  Are/-/  and  Are/ -2  cDNAs  (see  below) 
were  subcloned  into  the  low-copy-number  vector  p414GALl  (36),  using  the 
engineered  BamHl  (5')  and  A7foI  (3')  restriction  sites,  yielding  plasmids  pATl 
and  pAT2.  For  localization  assays  with  green  fluorescent  protein  (GFP),  cDNAs 
encoding  mouse  ATEl-lp  or  ATEl-2p  were  subcloned  into  the  pEGFP-Nl 
N-terminal  protein  fusion  vector  (Clontech,  Palo  Alto,  Calif.),  using  the  engi¬ 
neered  Y/ioI  (5')  andAgel  (3')  restriction  sites,  yielding  plasmids  pATl-GFP  and 
pAT2-GFP. 

Isolation  of  the  mouse  Atel-1  and  AteT2  cDNAs.  The  392-bp  fragment  of  the 
mouse  EST  (expressed  sequence  tag)  clone  (accession  no.  AA415294),  which  was 
identified  in  GenBank  through  species  walking  (see  Results),  was  used  as  a  probe 
to  screen  a  XgtlO-based  mouse  cDNA  library  from  MEL-C19  cells  (Clontech), 
using  standard  procedures  (3).  Eight  positive  clones,  whose  inserts  ranged  from 
0.5  to  1,6  kb,  were  analyzed  by  PCR  and  partial  sequencing.  The  cDNA  inserts 
of  clones  3  and  8  were  then  subcloned  into  pBluescript  II  SK*^  (29)  and  se¬ 
quenced  on  both  strands.  The  resulting  ORFs  were  identical  except  for  a  129-bp 
internal  region  (see  Results)  (Fig.  2A).  The  deduced  amino  acid  sequences  of  the 
mouse  cDNA  clones  3  and  8  were  weakly  but  significantly  similar  to  the  deduced 
sequences  of  Caenorhabditis  elegans  and  S.  cerevisiae  ATElp  (see  Results)  (Fig. 
2B)  and  corresponded  to  nucleotides  (nt)  699  to  1870  and  587  to  2099,  respec¬ 
tively,  in  the  subsequently  produced  full-length  mouse  Ate/-/  and  Ate  1-2  cDNAs 
(accession  no.  AF079096  and  AF079097). 

A  human  EST  clone  (accession  no.  AA503372)  whose  deduced  amino  acid 
sequence  was  highly  similar  to  that  of  the  partial  mouse  Atel  cDNA  done  8  was 
found  in  GenBank,  using  the  partial  mouse  ATElp  sequence  as  a  query.  This 
EST  clone  was  purchased  from  Genome  Systems  (St.  Louis,  Mo.)  and  sequenced 
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A  At  el  aAAAATCCAAAAGGAGGAAACACGAGAAACATAAGCATAACTCTCCATCTGGTGATGATAGTTCTGACCGAAGGCGGCGGTAGCGGCGGCTGCGTAGGACTGGGTGGCGCAGCGGCGGCG  120 

At  el  gGCTCGCGGACATGGCTTCTTGGAGCGCGCCTTCACCTAGCCTTGTAGAGTATTTTGAGGGCCAGACCTCCTTCCAG^TGGCTA(^C^G^C^GTTGGGCAGTCGCTCCTATGGCA  240 

Atel-1  tGTGGGCACATTCCATGACAGTGCAGGATTATCAGGATCTTATAGACCGAGGATGGAGAAGAAGTGGG^TATGTGTAC^CCTGTCATGGATCAAAC;^<^TCCTCAGTATACAA  360 

At  ei  -1  T^TOTGTCATCCTTTACAGTTTCAGCCATCAAAATCTCACAAGAAAGTTTTCAAAAAAATGCTGAAATTTCTGGCTAAAGGAGAGATCTCGAAAGGCAATTGTGAGGATGAGCCCATGG  480 

R  C  HPLQFQPSKSHKKVLKKMLKFLAKGEISKGNCEDEPMD  117 

Atel  *1  aTTCTACAGTGGAGGATGCTGTTGACGGTGACTTTGCATTAATTAACAAGCTGGATATAAAGTGTGATCTCAAAACACTCAGTGACCTCAAAGGAAGCATAGAGAGTGAAGAGAAGGAGA  600 

NrKLjr)X  I^C  S  X  HS  C£}CE1C  15v 

Atli-1  AAGAAAAGAGTATAAAGAAAGAAGTOTCT^G^TTCATTCATCCACAATCTATAGAGGAGAAGTTGGGCTCTGGTGAACCATCACATCCAATCAAAGTTCATATTGGTCCTAAGCCAG  720 
EKSIKKEGSKEFIHPQSIEEKLGSGEPSHPIKVHIGPKPG  197 

Atel-1  GCAAAGGGGCTGACTTGAGTAAGCCTCCATGTCGGAAAGCAAGGGAAATGAGGAAAGAAAGGCAAAGATTAAAACGGATGCAGCAGGCCTCAGCTGCAGCCTCGGAGGCTCAAGGTCAGC  840 
KGADLSKPPCRKAREMRKERQRLKRMQQASAAASEAQGQP  237 

QQlQpaS^ 

Rtel-2 _ GTGAGGCTAGTACCTGCCTCCT 

Atel-1  CAGTCTGTTTGTTACCAAAGGCTAAATCCAACCAGCCCAAGTCACTGOAAGATTTGATTTTTCAATCTTTACCAGAAAATGCATCGCACAAGTTAGAGGTGAGGGTGGTAAGATCTTCTC  960 
VCLLPKAKSNQPKSLEDLIFQSLPENASHKIiEBQvHRSSP  277 

BDQsSHS  SBlI  QSr  SLQTKQQVAXElQXAiaB  z  cbkSb 

Atel-2  ttgaggacccagagttcaactcatccttcaaccagtccttttctttatataccaagtatcaagtggctatacaccaggaagcacctgaaatatgtgaaaagtctgag - 

Atel-1  CACCAAGTCCTCAGTTCAGAGCCACATTTCAGGAGTCTTACCAGGTCTATAAACGCTACCAG  ATGGTTGTTCACAAGGACCCGCCTGATAAGCCAACCGTGAGCCAGTTTACAAGATTCC  1080 

f  qvQkrQQmvvEIk  dpQpk  p  tvQq  a  q  g  q  p  317 


Atel-2 

Atel-1 

Atel-2 

Atel-1 

Atel-2 

Atel-1 
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Atel-1 
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Atel-1 

Atel-2 

Atel-1 


TTTGCAGCTCACCATTGGAGGCAGAGCACCCTGCTGATGGACCAGAATGTGGTTATGGCTCCTTTCACCAGCAGTACTGGCTCGATGGGAAGATCATTGCTGTGGGGGTGTTAGACATTC  1200 
C  S  SPLEAEHPADGPECGYGSFHQQYWLDGKI1AVGVLD_IL357 

TCCCGTACTGTGTGTTTTCTGTGTATCTCTACTACGATCCTGATTATTCATTTCTGTCTTTGGGTGTCTATTCAGCATTAAGAGAAATTGCTTTTACTAGACAACTGCATGAGAAAACAT  1320 
c  VFSVYLYYDPDYSFLSLGVYSALREIAFTRQLHEKTS397 


P  Y 


CGCAACTCAGCTATTATTATATGGGTTTCTACATTCATTCCTGTCCCAAGATGAGATACAAGGGTCAATATAGACCTTCTGATTTGCTGTGTCCTGAGACGTATGTCTGGGTGCCCATTG  1440 
QLSYYYMGPYIHSCPKMRYKGQYRPSDLLCPETYVWV^P  _ E  437 

AGCAGTGCCTGCCTTCTCTGGACAACTCCAAGTACTGCCGTTTCAACCAGGACCCCGAAGCAGAGGATGAAGGACGCAGTAAGGAACTTGACCGACTAAGGGTGTTTCACAGACGATCTG  1560 
“  SLDNSKY  CRFNQDPEAEDEGRSKELDRLRVFHRRSA  477 


Q  C  L 


CCATGCCTTACGGTGTTTATAAGAATCACCAAGAAGACCCAAGTGAGGAGGCTGOTGTGCTGGAGTATGCAAACCTCGTAGGACAGAAGTGCTCGGAGAGGATGCTGCTGTTCAGACACT  1680 
MPYGVYKNHQEDPSEEAGVLEYANLVGQKCSERMLLFRH*  516 

GAAGGCCTGCACCTCAGCCCAGCGCCAGCCTGGCATCTCTTGCCACAGTCCCACCTGCTTCTCGGCTCCTGCAGCCATTCACCAAGTACATTCGTAGAGCTTTTTAAAAATATTTTGTGG  1800 
AATTGTATTTATGAGAATCTTGTGGTTAATATAAAGATTTTAAAATATTCTATGACCTAGCCCTTTAATTGGGACTTGTCCTTTGCAGCTTGCTTCTGGAGCAAGGTGTCCTTCATGTCT  1920 
CCAGTGAATATTTAGGGAAGGAAAATGAAGAAGTCAATATAAGTCATTTTGTCACAGTTACTGTTCTGTGCTGGAGAAATTCTGACACAGAGACTGTTCTTTTAGACCAATGGTTCTTAA  2040 
CCTGTGGGCTCACCACACTCACAGGCCGAATGTTAGATAGCCTGTGTGAGATCAACTCA  2099 
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lo  obtain  more  of  the  5'-proxinial  humixn  ATEl  sequence.  Reverse  transcription- 
PCR  (RT-PCR)  (3)  was  then  carried  out  with  poly(A)^  RNA  from  mouse 
embryonic  fibroblasts,  using  the  mouse  /l/e/-specific  reverse  primer  5'-CCTTT 
GGTAACAAACAGACTGGCTG-3’  and  the  forward  primer  5'-TCTCATAG 
ACCGAGGATGGCGAAG-3',  whose  sequence  was  derived  from  the  above 
human  EST  clone.  The  resulting  PCR  products,  which  appeared  as  a  smear  upon 
agarose  ge!  electrophoresis,  were  ligated  into  the  TA  cloning  vector  (Invitrogen, 
San  Dieco,  Calif.),  and  the  ligation  mixture  was  used  as  a  template  for  PCR  using 
a  nested  mouse  Ale  I  primer  5’-CTGCAGCTGAGGCCTGCTGCATCCG'3 
and  a  vector-specific  primer  5’-GTnTCCCAGTCACGAC-3'.  This  strategy 
yielded  a  single  major  DNA  species  (data  not  shown).  We  then  applied  .^  '-RACE 
(rapid  amplification  of  5'  cDNA  ends)  (3),  using  the  above  RT-PCR-derivcd 
sequence,  to  produce  the  full-length  cDNA  as  previously  described  (19). 

Analysis  of  the  mouse  Atel  gene.  The  mouse  genomic  DNA  from  L  cells  was 
used  as  a  template  for  PCR,  using  the  Expand  high-fidelity  PCR  system  (Bochr- 
inger,  Indianapolis.  Ind.)  and  exon-specific  primers  as  previously  described  (29), 
to  produce  DNA  fracments  that  together  spanned  ~4  kb  of  the  mouse  Alel  gene 
and  contained  the  two  alternative  \29-hp  Ate]  exons.  The  regions  encompassing 
exon/intron  junctions  were  sequenced  by  using  exon-  and  intron-specific  primers. 
Thereafter,  a  strateg>'  described  earlier  for  the  Uhrl  gene  (29)  was  used  to  screen, 
using  a  fragment  of  the  mouse  Aiel  cDNA  (nt  255-1139),  a  BAC  (bacterial 
artificial  chromosome )-based  library-  of  mouse  genomic  DNA  fragments  from 
strain  129SvJ  (Genome  Sy.stems).  yielding  one  BAC  clone  containing  the  mouse 

Atel  gene.  .... 

Isolation  of  the  human,  plant,  and  fly  ATE  1  cDNAs.  Using  the  cloned  mouse 
ATE  Ip  sequences  (see  above)  as  queries,  we  identified  in  Gen  Bank  several 
significantly  similar  EST  sequences  from  other  organisms  (data  not  shown).  To 
determine  whether  these  species  also  contained  the  two  forms  oi ATE  1  mRNA, 
we  isolated  the  corresponding cDNAs.  RT-PCR  (3)  with  poly(A)^  RNA 
from  human  293  cells  and  the  primers  5'-CAATGGCATGTGGGCACATTCC 
ATG-3'  (specific  for  the  human  EST  clone  AA503372  jsec  above])  and  5'-CC 
ACAGGTACn'GAATATGTATCCTG-3'  (specific  for  the  human  EST  cUme 
AA 195361)  was  carried  out,  yielding  a  1.6-kb  human  ATE!  cDNA  fragment 
lacking  the  first  41  codons  of  the. 4 ORF.  This  fragment  (a  mixture  of  the  two 
alternative  cDNAs)  was  subcloned  into  the  TA  vector  (Invitrogen)  and  se¬ 
quenced  on  both  strands.  Full-length cDNAs  hom  ArabUiopsis  tltaliana  and 
Drosophila  melanogasterwere  isolated  by  RT-PCR  as  well,  using  total  RNA  from 
A.  thaliana  leaves,  poly  (A)"  RNA  from  D.  melanogasier  embryos,  and  primers 
.specific  for  the  5'  and  3'  ends  of  the  corresponding  ORFs.  By  using  the  strategy 
described  above  for  the  human  ATE  I  cDNAs,  the  sequences  of  these  primers 
were  derived  from  the  EST  clones  that  were  initially  identified  in  GenBank  through 
their  similarity  to  the  mouse  ATElp  sequence,  then  purchased  from  Genome  Sys¬ 
tems,  and  sequenced  prior  to  RT-PCR  with  the  corresponding  RNA  preparations. 
The  final  human,  plant,  and  fiy  ATEl  cDNAs  were  sequenced  on  both  strands. 

Assays  of  p-gal.  Colony  assays  for  the  Escherichia  coli  p-galactosidase  (Pgal) 
in  5.  cerevisiae  were  carried  out  by  overlaying  yeast  colonies  on  SG  plates  with 
0.5%  agarose  containing  0.1%  sodium  dodecyl  sulfate  (SDS),  4%  dimcthylfor- 
mamide,  and  a  0.1 -mg/ml  solution  of  the  chromogenic  pgal  substrate  X-Gal 
(5-bromo-4-chloro-3-indolvl-p-D-galactopyranoside;  Calbiochem,  La  Jolla,  Cal¬ 
if.),  followed  by  incubation  for  1  to  2  h  at  3rC.  Quantitative  assays  for  pgal  in 
5.  cerevisiae  were  carried  out  with  whole-cell  extracts,  using  another  chromogenic 
Pgal  substrate,  o-nitrophenyl-p-D-galactopyranoside  (ONPG).  Cells  in  a  5-ml 
culture  of  ~1)  were  pelleted  bv  centrifugation  and  resuspended  in  5  ml  of 
buffer  Z  (60  mM  Na^HPOa.  40  mM  NaH2P04,  10  mM  KCl,  1  mM  MgS04,  50 
mM  p-mercaptoethanol  [pH  7.0]).  After  theA^oo  of  the  suspension  was  deter¬ 
mined  50-  or  100-til  samples  were  diluted  to  1  ml  with  buffer  Z;  0.1%  SDS  (20 
al)  and  CHCI3  (50  ^Il)  were  then  added;  the  suspension  was  vortexed  for  10  to 
15  s  and  incubated  for  15  min  at  30®C,  followed  by  the  addition  of  200  p.1  of 
ONPG  (4  mg/ml  in  buffer  Z)  and  further  incubation  at  30®C,  until  a  medium 
yellow  color  had  developed,  at  which  point  the  reaction  was  stopped  by  the 
addition  of  1  M  NaXO^  (0.4  ml).  The  mixture  was  centrifuged  for  5  min  at 
1.100  X  g,  and  theyf/^o  and^l^wiof  the  samples  were  measured.  The  ONPG  units 


(UoNPci)  activity  were  calculated  as  follows:  Uonpc.  ^  ^'000  x  |(.-U2(»)  “ 

(1.75  x' /1 5, „,)]//  X  \*  X  where  /  and  v  were,  respectively,  the  time  of 

incubation  (minutes)  and  the  sample  volume  (milliliters)  (3).^ 

Purification  and  N-terminal  sequencing  of  X-pgal  proteins.  Extracts  were 
prepared  (using  the  liquid  nitrogen  procedure  {3])  from  S.  cerevisiae  AVY34 
{tihrJS  atel\)  cotransformed  with  a  pUB23-X  plasmid  (4)  (expressing  Ub-X- 
pgal)  and  cither  pATl  (expressing  mouse  ATEl-lp)  or  pAT2  (expressing  mouse 
a"tE1-2p).  Cultures  were  grown  in  SG  to  an/l^,,H)  of  ~  1.  Specific  X-pgal  proteins 
(X  =  Asp  or  Cys)  were  purified  by  affinity  chromatography  on  ProioSorb  laeZ 
(Promega,  Madison.  Wis,).  a  monoclonal  anti-pga!  antibody  coupled  to  agarose 
beads  (Promega).  X-pgal  proteins  were  further  purified  by  electrophoresis  on 
SDS-7%  polyacix'lamide  gels  and  were  elcctroblotted  onto  Immobilon-P' 
membranes  (Millipore,  Bedford,  Mass.).  N-tcrminal  sequencing  of  10  to  15  pmol 
of  elcctroblotted  X-Pgal  was  carried  out  for  at  Ica.st  five  cycles,  using  an  Applied 
Biosvstcms  476A  protein  .sequencer  (Caltech  Microchemistrv'  Facility). 

Mouse  cell  cultures,  transfection,  and  GFP  localization.  NIH  3T.3  cells  (ATCC 
165S-CRL)  were  crown  as  monolayers  in  Dulbccco's  modified  Eagle  medium 
(GIBCO.  Frederick.  Md.)  supplemented  with  10%  fetal  bovine  scrum.  Cells  for 
GFP  localization  analyses  were  grown  to  —15%  confluence  on  glass  coverslips 
for  24  h  prior  to  transfection  with  either  pATl-GFP  or  pAT2-GFP,  using  Lipo- 
fcctamine  (GIBCO)  and  the  manufacturer-supplied  protocol.  Cells  were  incu¬ 
bated  for  5  h  at  37‘’C  in  serum-free  medium  containing  DNA  and  Lipofectamine. 
Thereafter  an  equal  volume  of  medium  containing  20%  scrum  wiis  added,  and 
the  cells  were  grown  for  another  12  to  20  h  at  37®C.  Cells  were  fixed  with  2% 
formaldehyde  in  phosphate-buffered  saline,  and  GFP  fluorescence  was  visualized 
in  a  Zeiss  Axiophot  microscope. 

Northern  hybridization.  Mouse  multiple-tissue  Northern  blots  containing  2  p.g 
of  poly(A)"  RNA  per  lane  (Clontcch)  were  probed  with  the  "'-P-labeled  1.1-kb 
mouse  ,4/c'y  cDNA  (nt  6.38  to  ,1734),  using  the  manufacturer-supplied  protocol. 

Determination  of  the  relative  levels  oiAlel-l  and  Atel-2  mRNAs.  Samples  of 
total  RNA  isolated  as  described  previously  (3)  from  mouse  spleen,  skeletal 
muscle,  liver,  brain,  testis,  and  embryonic  fibroblasts  were  subjected  to  RT-PCR 
(28  cycles).  The  primers  5'-CAGTGGAGGATGCTGTTGACGGTGAC-3  and 
5'-GtGCT(7rGCCTCCAATGGTGAGCTG-3'  were  specific  for  the  identical 
regions  of  Atel-J  and  Atel-2  cDNAs  that  flanked  the  two  129-bp  exons  (sec 
Results)  which  distinguished  these  cDNAs.  The  re.sulting  624-bp  product  (a 
mixture  of  ihc  Ate  ]-l  and  Ate  1-2  cDNA  fragments)  was  treated  with  .SVrFI.  which 
cuts  at  different  sites  within  the  two  129-bp  exons,  folUwed  by  a  2%^  agarose  gel 
electrophoresis.  This  procedure  made  it  possible  to  distinguish  the  Atel-1  and 
Atel-2  fragments.  The  ratios  of  the  two  forms  of  Atel  cDNA  were  determined  by 
serial  dilutions  of  the  samples  prior  to  gel  electrophoresis. 

Nucleotide  sequence  accession  numbers.  The  nucleotide  sequences  reported 
in  this  paper  were  submitted  to  the  GenBank/EMBL  data  bank  and  assigned 
accession  no.  AF079096  (mouse /l/c7-/  cDNA),  AF()79097  (mouse  v4/c/.2  cDNA), 
AF079098  (human  Atel-1  cDNA),  AF079099  (human /lT£:7-2  cDNA),  AF0791(K) 
{A.  thaliana  Atel  cDNA),  and  AF0791()1  {D.  melanogaster  Atel  cDNA). 


RESULTS 

Identification  of  mouse  cDNAs  by  species  walking.  On 
the  assumption  that  the  sequences  of  R-transferases  in  differ¬ 
ent  species  might  be  sufficiently  conserved  to  be  detected  by 
using  the  sequence  of  the  only  cloned  R-transferase,  5.  cerevi- 
siae  Atelp  (7),  we  have  been  searching  GenBank  and  related 
databases.  No  mammalian  sequences  in  GenBank,  including 
the  EST  sequences,  had  significant  similarities  to  5.  cerevisiae 
Atelp.  However,  we  did  identify  a  nematode  (C.  elegans)  ORF 
(accession  no.  Z21146)  that  exhibited  similarity  to  yeast  Atelp 
(Fig.  2B)  and  then  used  the  C  elegans  sequence  to  identify  a 


FIG  2  Two  forms  of  the  mouse  Atel  cDNA  and  the  ATE  protein  family.  (A)  The  mouse  Atel-1  and  Atel-2  cDNAs  and  their  products  The 
oiAte]-'>  identical  to  those  of  Atel-1  (everywhere  except  for  the  129-bp  region)  are  indicated  by  dashes.  In  the  region  of  the  alternative  U9-bp  exons 
Atel  2  white-on-black  and  gray  shadings  highlight,  respectively,  identical  and  similar  residues.  The  circled  Cys  residues  are  homologous  to  those  that  arc  important 
for  the  enzvmatic  activity  of\  cerexisiae  Atelp  (32),  (B)  The  ATE  protein  family  and  the  origins  of  the  alternative  129-bp  exons.  AJipment  of  the  sequences  of  mouse 
ATEl  lo  mrn-Atel-l)  A  thaliana  Atelp  (At-Atel),  D.  melanogaster  Atelp  (Dm-Atel),  C.  elegans  Atelp  (Ce-Atel),  and  5.  cerevisiae  Atelp  (Sc-Ajel)  (accession  no. 
S  SimL^lsM  as  follows:  M,  L,  ifand  V;  D,  E,  N,  and  Q;  R,  K,  and  H;  Y,T,  and  W;  S,  A,  and  T.  The  region  encoded  by  the  a  ternative 

1 79  bn  exons  of  mouse  Atel  is  highlighted  by  a  thick  line.  Of  the  Cys  residues  that  are  conserved  among  all  ATE  proteins,  the  ones  required  and  not  requii^ed  for  the 
nSarac^  indicated,  respectively,  by  ▼  and  V.  The  N-temtinally  truncated  mouse  ATEl-lp  and  ATEl-2p  Protems  wh.ch  began 

nf  Met  47  (m\  lacked  the  R-iransferase  activity  (data  not  shown).  The  highly  variable  C-terminal  regions  of  ATE  proteins  were  omitted  from  the  alignment.  The 
sea™  l!)e  a|ned  using  TO  Computer  Group,  Madison,  Wis,),  Gaps  (-)  were  introduced  to  opt.mtze  the  alignment 

The  residue  numbers  are  on  the  right  of  the  sequences.  The  sequence  of  C,  elegans  Atelp  appears  to  lack  the  N-terminal  region  of  other  ATE  proteins  because  of  an 
mor  ifdefining  the  A, el  ORF  in  the  genomic  DNA  sequence  (accession  no.  Z21146).  (C)  Alignment  of  the  43.residue  regions  that  are 
17Q  hr,  emns  in  mammalian /fief  Sequences  shown:  mouse  (Mm-Atel-1  and  Mm-Atel-2),  human  (Hs-Atel-1  and  Hs-Atel-2),  D.  melanogaster  {Dm-Atei).C. 
(Ce-Atel)  A  thaliana  (At-Atel)!  5.  %mbe  (Sp-Atel;  accession  no.  Z99568),  and  S.  cerevisiae  (Sc-Atel)  (accession  no.  J05404).  T^e  degrees  of  identity  and  snnilanty 
of  ATE  proteim  acid  sequences  of  the  M8  (mouse  ATEl-lp)  or  M3  (mouse  ATEl-2p)  exon  of  the  mouse  .4, cf  gene  are  indicated  on  the  right. 

The  residues  conserved  among  all  of  the  compared  sequences  are  indicated  above  the  alignment. 
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FIG.  3.  Expression  of  Atel-1  and  AteI-2  mRNAs  in  different  mouse  tissues 
and  mouse  embryonic  fibroblasts  (MEF).  Two  forms  of  Atel  cDNA  (Fig.  2A) 
were  amplified  in  a  single  reaction  by  RT-PCR.  using  the  same  primers,  to  yield 
624-bp  fragments  that  included  the  region  of  the  alternative  129-bp  exons  (see 
Materials  and  Methods).  The  624-bp  fragments  were  digested  with  ^crFI,  which 
produced  a  231-bp  fragment  (a  mixture  of  Atel-l  and  Atel-2),  a  284-bp,  Atel- 
/-specific  fragment,  and  a  3 17-bp, /I re /-2-specific  fragment.  The  untreated  (lanes 
a)  and  .ScrFI-treated  (lanes  b)  samples  from  different  mouse  tissues  were  ana¬ 
lyzed  by  electrophoresis  in  a  2%  agarose  gel.  The  ratio  of  the  two  forms  of  Atel 
mRNA,  defined  as  the  ratio  of  the  284-bp  (Ate I -I)  fragment  to  the  317-bp 
(Ate I -2)  fragment,  was  determined  by  analyzing  serially  diluted  samples  and 
comparing  the  resulting  band  intensities  (data  not  shown),  sk.,  skeletal. 


significantly  similar  EST  sequence  of  D.  melanogaster  cDNA 
(accession  no.  AA391570).  Finally,  using  the  deduced  amino 
acid  sequence  of  the  Drosophila  EST  clone  as  a  probe,  we 
identified  a  467-bp  mouse  EST  sequence  (accession  no. 
AA4 15294)  that  exhibited  weak  but  significant  similarity  to  the 
Drosophila  sequence  but  no  detectable  similarity  to  S.  cerevi- 
siae  Ate  Ip.  On  a  chance  that  this  467-bp  EST  had  been  derived 
from  the  mouse  Atel  cDNA,  we  used  it  to  screen  a  mouse 
cDNA  library  and  indeed  isolated  the  putative  mouse  Atel 
cDNAs  (Fig,  2A). 

Alternative  splicing  results  in  t^o  species  of  mouse  Atel 
cDNA  containing  distinct  but  homologous  exons.  During  the 
initial  mouse  cDNA  library  screening,  we  found  that  the  cDNA 
clone  3  (nt  699  to  1870  ofAteDl  cDNA)  was  identical  to  the 
cDNA  clone  8  (nt  587  to  2099  of  Atel-1  cDNA),  except  for  a 
129-bp  region  whose  deduced  amino  acid  sequences  were  sim¬ 
ilar  (31%  identity;  61%  similarity)  (Fig,  2).  The  two  full-length 
Atel  cDNAs  (termed  y4/c/-/  tind  Ate  1-2),  which  were  obtained 
by  RT-PCR  followed  by  5'-RACE  (see  Materials  and  Meth¬ 
ods),  encoded  proteins  of  identical  length,  516  residues  (59.2 
kDa  and  pi  of  8.14  versus  59.1  kDa  and  pi  of  7.22)  (Fig.  2A), 
that  contained  regions  of  similarity  to  the  57.8-kDa  Ate  Ip  of  5. 
cerevisiae  (Fig.  2B).  RT-PCR  (followed  by  subcloning)  with 
RNAs  from  different  mouse  tissues  also  produced  the  two 
forms  of  Atel  cDNAs,  indicating  that  the  two  species  were  in 
fact  present  in  the  initial  RNA  preparation  (Fig.  3  and  data  not 
shown). 

To  determine  whether  both  of  the  two  129-bp  regions  of  the 
Atel-1  and  Atel -2  cDNAs  were  a  part  of  the  Atel  gene,  and 
whether  tind  Atel-2  were  produced  through  alternative 

splicing,  we  analyzed  the  mouse  Atel  gene  in  the  vicinity  of  its 
two  129-bp  exons,  using  at  first  PCR  and  subsequently  a  BAC 
clone  containing  Atel  (see  Materials  and  Methods).  The  two 
129-bp  exons  were  located  next  to  each  other  in  the  Atel  gene 
(Fig.  4A).  We  also  found  that  the  12-bp  sequences  around  the 
splice  acceptor  sites  of  these  exons  (6  bp  in  the  intron  and  6  bp 
in  the  exon)  were  identical  between  the  two  exons  (Fig.  4B), 
consistent  with  the  alternative  presence  of  these  exons  in  the 
mature  Atel  mRNA.  The  exon-containing  RT-PCR  products 
from  different  mouse  tissues  appeared  as  a  single  major  band 
retaining  one  of  the  two  129-bp  e.xons  (Fig.  3  and  data  not 
shown).  Subcloning  and  analyses  of  these  RT-PCR  products 
yielded  no  other  differentially  spliced  Atel  cDNAs  (for  exam¬ 
ple,  cDNAs  retaining  both  or  neither  of  the  two  129-bp  exons). 
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FIG.  4.  The  two  forms  of  mouse  At\il  mRNA  are  produced  by  alternative 
splicing.  (A)  The  two  alternative  129-bp  exons  are  adjacent  in  the  mouse  Atel 
gene.  The  thick  line  denotes  genomic  DNA;  the  striped  and  black  rectangles 
denote  the  alternative  129-bp  exons,  M8  and  M3  (see  Materials  and  Methods): 
gray  rectangles  denote  the  flanking  Atel  exons,  of  unknown  sizes:  thin  lines 
denote  the  alternative  splicing  patterns  that  yield  the  two  forms  of  Atel  mRNA. 
(B)  The  underlined  12  bp  (6  bp  in  the  intron  and  6  bp  in  the  exon)  around  the 
splice  acceptor  sites  are  identical  between  the  two  alternative  129-bp  exons.  (C) 
Scale  diagrams  of  the  two  forms  of  mouse  Atel  cDNAs.  The  alternative  129-bp 
exons  M8  and  M3  are  indicated  by  the  striped  and  black  boxes,  respectively. 


suggesting  that  the  splicing  of  Atel  pre-mRNA  is  tightly  regu¬ 
lated  to  retain  one  and  only  one  of  the  two  alternative  exons. 
Thus,  the  two  forms  of  Atel  mRNAs  are  produced  by  a  nearly 
unprecedented  (see  Discussion)  splicing  pathway  which  ulti¬ 
mately  yields  two  proteins  of  identical  size  that  bear  two  alter¬ 
native,  homologous  but  distinct  43-residue  internal  sequences 
(Fig.  4C). 

The  absence  of  alternative  129-bp  exons  from  the  plant  and 
^yAtel  genes.  To  explore  the  evolution  of  Atel,  and  especially 
the  phylogeny  of  its  alternative  129-bp  exons,  we  cloned  the 
human,  plant  {A.  thaliana),  and  fly  (D.  melanogaster)  ATEl 
cDNAs  (see  Materials  and  Methods).  Two  forms  of  the  human 
ATEl  cDNA,  termed  Hs-ATEl-1  and  Hs-ATEl-2,  v^exe  iso¬ 
lated  from  human  293  cells  (the  forms’  molar  ratio  was  about 
1).  However,  only  one  form  of  the  Atel  cDNA  was  isolated 
from  either  the  leaves  of  ^4.  thaliana  (termed  At-Atel)  or  D. 
melanogaster  embryos  (termed  Dm-Atel),  suggesting  that  the 
alternative  129-bp  exons  may  not  be  present  in  the  Atel  genes 
of  plants  and  arthropods.  The  A,  thaliana  and  D.  melanogaster 
Ate  Ip  proteins  were,  respectively,  629  and  477  residues  long 
(71  and  55  kDa,  with  pis  of  6.0  and  8.4).  Mouse  ATEl-lp  was 
82,  38,  and  42%  identical  (as  well  as  91,  57,  and  61%  similar) 
to  human  ATEl-lp,  A.  thaliana  Ate  Ip,  and  D,  melanogaster 
Ate  Ip,  respectively  (Fig.  2  and  data  not  shown).  A.  thaliana 
Atelp  bore  a  16-residue  region  containing  exclusively  Asp  or 
Glu  (data  not  shown). 

We  used  RT-PCR  and  RNA  preparations  from  A.  thaliana 
and  D.  melanogaster  to  amplify  the  relevant  regions  of  the 
corresponding  Atel  cDNAs.  The  resulting  fragments  were  di¬ 
gested  with  restriction  enzymes  that  recognize,  in  each  species, 
exclusively  the  region  that  corresponds  to  the  129-bp  exons  of 
the  mouse  Atel  cDNAs,  and  the  products  were  analyzed  by  gel 
electrophoresis.  The  initial  cDNA  fragments  of^.  thaliana  and 
D.  melanogaster  Atel  completely  disappeared  after  this  treat¬ 
ment,  in  contrast  to  the  homologous  mouse  cDNA  fragment 
(which  contained  two  distinct  sequences  of  identical  length), 
suggesting  that  the  two  alternative  exons  were  absent  from  the 
Atel  genes  of  plants  and  arthropods  (Fig.  5A). 

While  this  analysis  was  under  way,  complete  sequences  of 
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FIG.  5.  Alternative  splicing  of  Atel  pre-mRNA  in  mammals  (the  mouse)  but  in  neither  plants  {A.  thaiiana)  nor  arthropods  (D.  melanogaster).  (A)  The  relevant/4re/ 
cDNA  fragments  from  mouse  (794  bp).  A.  thaliana  (626  bp),  and  D.  melanogaster  (856  bp)  were  produced  by  RT-PCR  (see  Materials  and  Methods).  The  products  were 
treated  with  the  indicated  restriction  endonucleases  that  cut  exclusively  within  the  two  alternative  129-bp  exons  of  mouse /l/e7  cDNAs  (RjpMI  for  Atel-2;  5exAI  for 
Atel’l)  or  within  the  corresponding  regions  of  A.  thaliana  {Pvull  and  5crFI)  and  D.  melanogaster  {Accl  and  £coRV)  AteJ  cDNAs.  (B)  The  A.  thaliana  and  D. 
melanogaster  AteJ  genes.  The  exon-intron  organization  of  these  genes  was  deduced  through  comparisons  of  their  cDNA  sequences,  determined  in  this  work  (see 
Materials  and  Methods),  with  the  concurrently  determined  sequences  of  the  corresponding  genomic  DNA  regions  (see  text).  The  horizontal  lines  and  rectangles  denote, 
respectively,  introns  and  exons,  whose  lengths  are  indicated  below  and  above  the  line  denoting  introns.  Thick  horizontal  lines  indicate  the  regions  of  thaliana  and 
D.  melanogaster  cDNAs  that  correspond  to  the  alternative  l29-bp  exons  of  the  mouse  and  human  AteJ  cDNAs  (Fig.  2  and  4).  The  lengths  of  the  A.  thaliana  and  D. 
melanogaster  AteJ  genes  are,  respectively,  -3  and  ~2.5  kb. 


the  A.  thaliana  and  D.  melanogaster  Atel  loci,  determined 
through  the  corresponding  sequencing  projects,  were  depos¬ 
ited  in  GenBank  (accession  no.  AA005237  and  accession  no. 
AC004321,  respectively).  By  comparing  the  cloned  cDNAs 
(see  Materials  and  Methods)  and  the  corresponding  genomic 
sequences  of/4,  thaliana  and  D.  melanogaster,  we  could  deduce 
the  organization  of  these  Atel  genes.  The  results  (Fig,  5B) 
directly  confirmed  the  absence  of  the  alternative  homologous 
exons  from/4/6"/  of /I.  thaliana  and  D.  melanogaster,  in  contrast 
to  mammalian /4/^/.  The  corresponding  region  of  plant  Atelp 
is  more  similar  to  the  exon-encoded  sequence  of  mouse  ATEl- 
Ip,  whereas  in  Drosophila  this  region  is  more  similar  to  the 
alternative  sequence  of  ATEl-2p  (Fig.  2C).  The  corresponding 
regions  of  5.  cerevisiae  and  Schizosaccharomyces  pombe  Atelp 
are  not  preferentially  similar  to  either  of  the  two  alternative 
exon-encoding  sequences  of  mouse  ATE  Ip  (Fig.  2C). 

Mouse  ATEl-lp  and  ATEl-2p  can  implement  the  Asp/Glu- 
specific  subset  of  the  N-end  rule  pathway  but  differ  in  activity. 
To  determine  whether  the  two  putative  mouse  R-transferases 
arc  in  fact  R-transferases  and  to  compare  their  activities  in  an 
in  vivo  setting,  we  examined  whether  ATEl-lp  and  ATEl-2p 
could  confer  metabolic  instability  on  Asp-Pgal  and  Glu-Pgal  in 
fl/c/A  S,  cerevisiae.  Asp  and  GIu  are  secondary  destabilizing 
residues  in  the  N-end  rule  (Fig.  1  and  introduction).  The  test 
substrates  Asp-Pgal  and  Glu-Pgal  (produced  through  cotrans- 
lational  deubiquitylation  of  Ub-Asp-Pgal  and  Ub-Glu-pgal 
(4))  are  short-lived  in  wild-type  yeast  (half-lives  of  ~3  and  ~30 
min,  respectively)  but  long-lived  (half-life  of  >20  h)  in  atel^  S, 
cerevisiae  that  lacks  the  ^TE/ -encoded  yeast  R-transferase  (5, 
7).  Previous  work  (19, 33)  has  shown  that  the  steady-state  level 
of  an  X-Pgal  protein  is  a  sensitive  measure  of  its  metabolic 
stability. 

S.  cerevisiae  ate  11  cells  were  cotransformed  with  a  pair  of 


plasmids  that  expressed  one  of  the  two  putative  mouse  R- 
transferases,  ATEl-lp  or  ATEl-2p,  and  one  of  several  test 
substrates  (as  the  corresponding  Ub  fusions):  Asp-Pgal,  Glu- 
pgal,  Arg-pgal,  (2ys-Pgal,  or  Met-Pgal.  Met  and  Cys  are  stabi¬ 
lizing  residues  in  the  yeast  N-end  rule;  Arg  is  a  primary  desta¬ 
bilizing  residue;  Asp  and  Glu  are  secondary  destabilizing 
residues  (5,  56).  Control  tests  included  either  the  vector  alone 
or  a  plasmid  expressing  S.  cerevisiae  Atelp.  The  steady-state 
levels  of  X-Pgal  proteins  were  determined  by  measuring  the 
enzymatic  activity  of  Pgal  in  yeast  extracts.  Using  this  assay,  we 
found  that  both  forms  of  mouse  ATElp  were  able  to  confer 
metabolic  instability  on  either  Asp-pgal  or  Glu-Pgal  in  ate  11 S. 
cerevisiae  (Fig.  6A).  ATEl-lp  and  ATEl-2p  destabilized  Glu- 
Pgal  much  less  than  Asp-Pgal  (Fig.  6A),  consistent  with  Glu 
being  a  less  destabilizing  residue  in  the  N-end  rule  than  Asp, 
presumably  because  of  less  efficient  arginylation  of  the  N- 
terminal  Glu  by  R-transferases  (54).  However,  while  the  ap¬ 
parent  destabilizing  activity  of  the  mouse  ATEl-lp  R-trans- 
ferase  was  only  slightly  lower  than  that  of  S.  cerevisiae  Atelp 
(expressed  from  the  identical  vector  and  promoter),  the  activ¬ 
ity  of  mouse  ATEl-2p  was  significantly  lower  than  that  of 
ATEl-lp  (Fig.  6A). 

We  also  asked  whether  the  two  forms  of  mouse  ATE  Ip 
could  influence  each  other's  activity  if  they  were  coexpressed  in 
the  same  cell  (such  an  influence  might  be  expected,  for  in¬ 
stance,  if  the  active  form  of  R-transferase  were  a  dimer  or  if 
the  two  forms  of  R-transferase  competed  for  binding  to  the 
same  component  of  a  pathway).  5.  cerevisiae  ate  11  cells  were 
cotransformed  with  two  plasmids  bearing  different  selectable 
markers  and  expressing  different  combinations  of  ATEl-lp 
and  ATEl-2p  (1  +  1,  2+2,  or  1+2),  and  also  with  a  plasmid 
expressing  one  of  the  X-Pgal  test  proteins  (X  =  Met,  Arg,  Cys, 
Asp,  or  Glu),  Control  cells  were  cotransformed  with  the  two 
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FIG.  6.  The  two  forms  of  mouse  ATElp  can  implement  the  Asp/Glu-specific  subset  of  the  N-end  rule  pathway.  (A)  Relative  enzymatic  activities  of  Pgal  in  a/e/A 
S.  cerevisiae  transformed  with  plasmids  expressing  X-pgal  (as  Ub-X-Pgal)  test  proteins  (X  =  Met,  Arg,  Cys,  Asp,  or  Glu)  together  with  a  plasmid  expressing  either  yeast 
ATE}  (denoted  as  y),  mouse  ATEl-lp  (denoted  as  1),  mouse  ATEl-2p  (denoted  as  2),  or  the  vector  alone  (denoted  as  v).  The  N-terminal  residues  of  X-pgals  in  each 
set  of  experiments  are  indicated  at  the  top.  The  activity  of  Met-pgal  in  cells  transformed  with  vector  alone  is  taken  as  100%.  (B)  The  two  forms  of  mouse  ATEl  p  exhibit 
no  cooperativity  in  mediating  the  degradation  of  X-pgals.  Shown  are  relative  enzymatic  activities  of  X-pgals  in  ateJA  S.  cerevisiae  strain  cotransformed  with  plasmids 
expressing  X-pgals  (Ub-X-Pgals)  (X  =  Met,  Arg,  Cys,  Asp,  or  Glu)  and  the  combinations  of  plasmids  expressing  the  following  proteins:  mouse  ATEl-lp  and  ATEl-lp 
(1  +  1),  ATEl-lp  and  ATEl-2p  (1+2),  ATEl-2p  and  ATEl-2p  (2+2),  or  two  vector  controls  (v+v).  One  of  the  two  vectors  bore  the  TRl^l  marker  and  the  other  bore 
the  HIS3  marker  (see  Materials  and  Methods).  Results  are  averages  of  four  independent  measurements,  which  differed  by  less  than  10%. 


vectors  alone.  The  results  (Fig.  6B)  indicated  that  the  total  Mouse  ATEl-lp  and  ATEl-2p  destabilize  Asp-pgal  and 
activities  of  the  1  +  1  and  1+2  combinations  (measured  as  the  Glu-pgal  through  arginylation  of  their  N-terminal  residues, 

extent  of  destabilization  of  Asp-pgal  or  Glu-pgal)  were  similar  To  verify  directly  that  mouse  ATEl-lp  and  ATEl-2p  in  fact 

to  each  other  and  much  higher  than  the  total  activities  of  2+2  possess  the  R-transferase  activity,  we  constructed  the  atel^. 

(Fig.  6B),  consistent  with  the  conjecture  that  the  two  forms  of  ubrlL  S.  cerevisiae  double  mutant  AVY34,  which  lacked  both 

mouse  R-transferase  do  not  interact  and  that  ATEl-lp  is  a  R-transferase  and  N-recognin  (E3),  the  main  recognition  corn- 

more  (possibly  much  more)  active  enzyme  than  ATEl-2p.  ponent  of  the  N-end  rule  pathway  (see  Materials  and  Meth- 

Neither  ATEl-lp  nor  ATEl-2p  confers  metabolic  instability  ods).  Consequently,  N-terminal  arginylation  of  a  test  protein  in 

on  Cys-Pgal.  Cysteine  is  a  stabilizing  residue  in  the  yeast  N-end  this  mutant  by  an  exogenous  R-transferase  would  not  result  in 

rule  but  a  secondary  destabilizing  residue  in  multicellular  or-  degradation  of  the  protein,  thereby  making  it  possible  to  iso- 

ganisms  such  as  mammals  and  amphibians  (14, 18, 30, 54).  The  late  enough  of  the  test  protein  for  N-terminal  sequencing, 

presence  of  two  alternative  regions  in  the  two  forms  of  mouse  Strain  AVY34  was  transformed  with  pUB23-D  (expressing 

R-transferase  (Fig.  4C)  initially  suggested  that  one  R-trans-  Ub-Asp-pgal)  and  also  with  either  pATl  (expressing  ATEl- 

ferase  might  be  specific  for  N-terminal  Asp  and  Glu,  with  the  Ip),  pAT7  (expressing  ATEl-2p),  or  vector  alone  and  was 

other  specific  for  Cys.  However,  Cys-Pgal,  which  is  long-lived  grown  in  SG  medium.  Asp-Pgal  proteins  isolated  from  these 

in  wild-type  5.  cerevisiae  (5),  remained  long-lived  in  the  pres-  transformants  were  subjected  to  N-terminal  sequencing  (see 

ence  of  either  ATEl-lp  or  ATEl-2p  (Fig.  6A).  This  finding  Materials  and  Methods).  The  results  (Table  1)  directly  con- 

and,  more  directly,  the  results  of  amino  acid  sequencing  (see  firmed  that  both  ATEl-lp  and  ATEl-2p  possessed  R-trans- 

below)  suggest  the  existence  of  a  mammalian  tRNA-depen-  ferase  activity.  In  agreement  with  the  finding  that  ATEl-lp 

dent  enzyme  (presumably  a  distinct  R-transferase)  (18)  that  was  more  active  than  ATEl-2p  in  destabilizing  Asp-Pgal  in 

mediates  destabilizing  activity  of  N-terminal  Cys.  vivo  (Fig.  6A),  Asp-pgal  from  cells  expressing  ATEl-lp  was 
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TABLE  L  N-tcrminal  .sequencing  of  X-Pgal  proteins  isolated  from 
atclX  uhrll  S.  cerevisiae  expressing  different  R-transfcrascs 


Substrate 

Cocxprcsscd 

protein 

N-terminal 

.sequence 

Yield  ) 

D-c*^-pgal 

D-H-G-S-A- 

D-c*^-PgaI 

Vector  alone 

D-H-G-S-A- 

-100 

D-c^-pgal 

Mou.se  ATEl-lp 

R-D-H-G-S-A- 

-100 

D-c^-3gal 

Mouse  ATEl-2p 

R-D-H-G-S-A- 

~50 

D-H-G-S-A- 

-50 

C-c'^-pgal 

C-H-G-S-A- 

C-c*^-3gal 

Vector  alone 

C-H-G-S-A- 

~20 

C-c*^-|3gal 

Mouse  ATEl-lp 

C-H-G-S-A- 

~20 

C-c^-Pgal 

Mouse  ATEl-2p 

C-H-G-S-A- 

--20 

found  to  be  completely  arginylated,  whereas  Asp-pgal  from 
cells  expressing  ATEl-2p  was  arginylated  to  approximately 
50%  (Table  1). 

We  also  determined,  using  the  above  procedure,  whether 
mouse  ATE  1- Ip  or  ATEl-2p  could  arginylate  N-terminal  Cys. 
Approximately  80%  of  Cys-Pgal  isolated  from  atelA  ubrlA  S. 
cerevisiae  was  found  to  be  N-terminally  blocked,  presumably 
acctylated  (Table  1).  However,  the  rest  of  Cys-pgal  (~20%) 
bore  the  N-terminal  sequence  beginning  with  Cys  and  lacking 
N-terminal  Arg,  in  agreement  with  the  results  of  the  in  vivo 
Cys-Pgal  degradation  assays  (Fig.  6  and  Table  1).  Thus,  both 
ATE  1 -Ip  and  Atel-2p  are  apparently  unable  to  utilize  N- 
terminal  Cys  as  a  substrate  in  5.  cerevisiae, 

ATEl-2p  is  exclusively  cytosolic,  whereas  ATEl-lp  is 
present  in  either  the  nucleus  or  the  cytosol.  To  determine  the 
intracellular  location  of  the  two  forms  of  mouse  R-transferase, 


we  constructed  fusions  to  the  N  terminus  of  GFP  and  tran¬ 
siently  expressed  them  in  NIH  3T3  cells.  Whereas  the  free 
26-kDa  GFP  was  located  in  both  the  nucleus  and  the  cytosol 
(data  not  shown),  the  85-kDa  Atel-2p~GFP  fusion  was  exclu¬ 
sively  cytosolic  in  all  of  the  many  transfected  cells  examined 
(Fig.  7a  to  c).  In  contrast,  the  85-kDa  ATEl-lp-GFP  (the 
alternative  form  of  R-transferase  that  is  much  more  active 
enzymatically  than  ATEl-2p)  was  found  to  be  localized  differ¬ 
ently  in  different  cells  on  the  same  coverslip,  possibly  depend¬ 
ing  on  their  cell  cycle  position  and/or  metabolic  state.  Specif¬ 
ically,  in  ~50%  of  the  transfected  cells,  ATEl-lp-GFP  was 
exclusively  cytosolic  (Fig.  7d  and  e),  as  was  ATEl-2p-GFP 
(Fig.  7a  to  c),  but  in  the  other  -50%  of  cells,  ATEl-lp-GFP 
was  present  in  the  nucleus  as  well  and,  moreover,  appeared  to 
be  significantly  enriched  in  the  nucleus  (Fig.  7f  and  g).  Thus, 
the  two  43-residue  alternative  regions  in  ATEl-lp  and 
ATEl-2p  confer  overlapping  but  nonidentical  intracellular  dis¬ 
tributions  on  the  respective  R-transferases. 

While  the  nonuniformity  of  the  ATEl-lp-GFP  localization 
among  mouse  cells  in  a  single  culture  remains  to  be  under¬ 
stood,  its  preferential  location  in  the  nuclei  of  some  cells  is 
consistent  with  a  high  content  of  basic  residues  in  its  43-rcsidue 
region,  in  comparison  to  the  alternative  homologous  region  of 
ATEl-2p  (Fig.  2C).  (No  sequences  fitting  the  consensus  se¬ 
quences  of  known  nuclear  localization  signals  could  be  de¬ 
tected  in  the  43-residue  region  of  ATEl-lp).  In  contrast  to 
mouse  R-transferases,  S,  cerevisiae  Ate  Ip  was  shown  to  be 
located  predominantly  in  the  nuclei  of  yeast  cells  (56a). 

The  ratio  of  Atel-1  to  Ateh2  mRNA  varies  greatly  among 
mouse  tissues.  Northern  hybridization,  using  the  1.1 -kb  mouse 
Atel  cDNA  fragment  (nt  638  to  1734)  as  a  probe,  detected  a 
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FIG.  7.  Intracellular  localization  of  mouse  ATEl-lp  and  ATEl-2p.  Shown  are  green  (GFP)  fluorescence  (a  to  d  and  f)  and  phase-contrast  (e  and  g)  micrographs 
of  mouse  NIH  3T3  cells  transiently  transfected  with  ATT  1-1  p-GFP  (d  to  g)  or  ATEl-2i>-GFP  (a  to  c)  fusion  proteins  (see  Materials  and  Methods).  Panels  a  to  c  show 
different  examples  of  the  exclusively  cytosolic  localization  of  ATEl-2p.  Regions  around  the  nucleus  and  in  the  lamellar  protrusions  at  the  edges  of  a  cell  (c)  exhibit 
higher  GFP  fluorescence,  possibly  because  of  a  greater  thickness  of  cells  in  these  areas.  Panels  d  plus  e  and  f  plus  g  show  pairs  of  GFP  fluorescence  and  phase-contrast 
pictures  of  cells  that  express  AT^l-lp-GEP.  The  cell  in  panels  d  and  e  shows  ATEl-lp-GFP  in  the  cytosol  but  not  in  the  nucleus.  Cells  in  panels  f  and  g  contain 
ATEl-lp-GFP  in  both  the  cytosol  and  the  nucleus,  the  latter  being  apparently  enriched  in  ATEl-lp-GFP. 
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FIG.  8.  Northern  hybridization  analyses  of  mouse  AteJ  mRNA.  The  North¬ 
ern  blots  of  mRNA  from  different  mouse  tissues  were  probed  with  an  Ate]  cDNA 
fragment  (nt  638  to  1734)  which  can  hybridize  to  both  forms  oiAtel  mRNA.  A 
mouse  p-actin  cDNA  probe  was  used  for  comparing  the  total  RNA  loads  as 
described  previously  (19).  The  same  blot  was  also  hybridized  with  mouse  Vhrl 
(29)  (see  the  introduction).  The  apparent  absence  of  Atel  and  Vhrl  mRNAs 
from  the  spleen  is  an  artifact  of  RNA  degradation  in  this  lane  of  the  blot  (data 
not  shown).  Also  shown  arc  the  results  of  analogous  Northern  hybridizations  of 
the  mouse  NtanI  cDNA,  encoding  the  Asn-specific  Nt-amidase  (Fig.  lA),  and 
E2,4k^  encoding  the  relevant  Ub-conjugating  (E2)  enzyme  (19).  The  approxi¬ 
mate  sizes  of  transcripts  arc  indicated  on  the  right. 


single  ~5.0-kb  transcript  (a  mixture  of  XhtAtel-l  and  AteI-2 
mRNAs)  in  all  of  the  mouse  tissues  examined  except  the  testis, 
where  the  ~5-kb  Atel  mRNA  was  a  minor  one,  the  major 
species  being  —2  kb  (Fig.  8).  Both  ATElp  and  the  other 
targeting  components  of  the  mammalian  N-end  rule  pathway 
are  expressed  ubiquitously  (at  various  levels),  and  the  testis- 
specific  patterns  of  transcripts  are  characteristic  for  all  of  them 
as  well  (Fig.  8).  The  existence  of  the  Y-chromosome-encoded, 
testis-specific  variant  of  the  Ub-activating  (El)  enzyme  (27, 35) 
suggests  that  the  testis-specific  modifications  of  the  N-end  rule 
pathway  may  be  functionally  relevant  in  spermatogenesis. 

To  determine  the  ratio  of  Atel -1  io  Ate  1-2  mRNA  in  differ¬ 
ent  mouse  tissues  or  cells  in  culture,  we  employed  RT-PCR, 
using  sequence  differences  between  the  two  alternative,  ho¬ 
mologous  129-bp  exons  to  distinguish  between  them  (see  Ma¬ 
terials  and  Methods)  (Fig.  3).  Approximately  equal  amounts  of 
Atel-1  and  Atel-2  mRNAs  were  present  in  mouse  embryonic 
fibroblasts  and  in  human  293  cells  in  culture  (Fig.  3  and  data 
not  shown).  However,  the  molar  ratio  of  Atel-1  to  Atel-2 
mRNA  was  found  to  vary  greatly  among  the  mouse  tissues:  it 
was  ~0.1  in  the  skeletal  muscle,  ^0.25  in  the  spleen,  —3.3  in 
the  liver,  and  brain,  and  —10  in  the  testis  (Fig.  3).  Thus,  while 
the  total  expression  of  Atel  (Atel-1  plus  Atel-2)  varies  by  2-  to 
4-fold  among  mouse  tissues  (Fig.  8),  the  difference  in  expres¬ 
sion  levels  between  Atel-1  and  Atel-2  mRNAs  can  be  as  high 
as  a  100-fold  (the  skeletal  muscle  versus  the  testis)  (Fig.  3), 
suggesting  that  the  two  forms  of  R-transferase  may  be  func¬ 
tionally  distinct. 


DISCUSSION 

The  N-end  rule  pathway  is  one  of  several  proteolytic  path¬ 
ways  of  the  Ub  system  (23,  54,  55).  Among  the  targets  of  the 
N-end  rule  pathway  are  proteins  that  bear  destabilizing  N- 
terminal  residues.  In  the  yeast  5,  cerevisiae,  Asn  and  Gin  are 
tertiary  destabilizing  N-terminal  residues  in  that  they  function 
through  their  conversion,  by  a  specific  amidase  (6),  to  the 
secondary  destabilizing  N-terminal  residues  Asp  and  Glu.  The 
destabilizing  activity  of  N-terminal  Asp  and  Glu  requires  their 
conjugation,  by  the /ITEi -encoded  R-transferase,  to  Arg,  one 
of  the  primary  destabilizing  residues  (7)  (Fig,  IB).  In  mam¬ 
mals,  the  set  of  secondary  destabilizing  residues  contains  not 
only  Asp  and  Glu  but  also  Cys,  which  is  a  stabilizing  residue  in 
yeast  (18,  54)  (Fig.  1). 

In  this  work,  we  isolated  cDNA  encoding  the  mouse  R- 
transferase,  ATElp,  and  found  that  this  enzyme  exists  in  two 
forms,  termed  ATEl-lp  and  ATEl-2p,  which  differ  by  con¬ 
taining  one  of  the  two  alternative,  homologous  43-residue  re¬ 
gions.  The  two  516-residue  R-transferases  are  produced  from 
the  mouse  Atel  gene  by  a  pathway  of  alternative  splicing  that 
retains  one  or  the  other  of  the  two  homologous  129-bp  exons. 
The  presence  of  two  adjacent,  homologous,  equal-length,  and 
alternatively  utilized  exons  in  a  gene  (Fig.  4)  is  nearly  unprec¬ 
edented.  To  our  knowledge,  just  one  such  case  was  described 
previously:  the  mouse  kE2  enhancer-binding  protein  E12/E47 
(37).  The  two  KE2-binding  proteins,  E12  and  E47,  arc  pro¬ 
duced  through  a  switch  between  two  alternative,  equal-length 
exons,  resulting  in  tw'o  helix-loop-helix  DNA-binding  proteins 
that  differ  in  the  ability  to  homodimerize.  Specifically,  E47  can 
bind  to  the  kE2  enhancer  either  as  a  homodimer  or  as  a 
heterodimer  with  MyoD,  whereas  E12  can  bind  as  a  het¬ 
erodimer  with  MyoD  but  not  as  a  homodimer  (37). 

We  report  the  following  major  findings. 

(i)  Identification,  through  species  walking,  and  isolation  of 
the  mouse  cDNA  encoding  R-transferase  (or  ATElp)  have 
shown  that  mammalian  ATElp  exists  in  two  forms,  ATEl-lp 
and  ATEl-2p,  which  differ  exclusively  by  one  of  the  two  alter¬ 
native,  homologous  43-residue  regions  (Fig,  2A). 

(ii)  The  corresponding  alternative  129-bp  exons  are  adjacent 
in  the  mouse  Atel  gene.  Moreover,  the  12-bp  sequences 
around  the  splice  acceptor  sites  of  these  exons  (6  bp  in  the 
intron  and  6  bp  in  the  exon)  are  identical  between  the  two 
exons  (Fig.  4).  The  splicing  of  Atel  pre-mRNA  proceeds  in 
such  way  that  one,  and  only  one,  of  the  alternative  129-bp 
exons  is  always  retained  in  the  mature  Atel  mRNA. 

(iii)  The  human  >1 7^7  gene  also  contains  the  two  alternative 
129-bp  exons,  whereas  the  plant  (A.  thaliana)  and  fly  {D.  mela- 
nogaster)  Atel  genes  encode  a  single  form  of  ATElp  (Fig.  2 
and  5).  The  corresponding  43-residue  regions  are  significantly 
similar  among  all  of  the  sequenced  R-transferases,  from  S. 
cerevisiae  to  mammals  (Fig.  2C).  The  set  of  Atel  genes  from 
mammals  to  yeast  defines  a  distinct  family  of  proteins,  the 
ATE  family.  The  splicing-derived  alternative  forms  of  R-trans¬ 
ferase  have  evolved  apparently  after  the  divergence  of  the 
arthropod  and  vertebrate  lineages. 

(iv)  Expression  of  the  mouse  Atel-1  and  Atel-2  cDNAs  in 
atel^  S.  cerevisiae,  and  N-terminal  sequencing  of  isolated 
X-pgal  test  proteins,  was  used  to  show  that  ATEl-lp  and 
ATEl-2p  could  implement  the  Asp/Glu-specific  subset  of  the 
N-end  rule  pathway  and  that  they  did  so  through  the  argin- 
ylation  of  N-terminal  Asp  or  Glu  in  the  test  substrates  (Fig.  6A 
and  Table  1). 

(v)  While  the  destabilizing  activity  of  the  mouse  ATEl-lp 
R-transferase  is  only  slightly  lower  than  that  of  S.  cerevisiae 
R-transferase,  the  activity  of  mouse  ATEl-2p  is  significantly 
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(possibly  considerably)  lower  than  that  of  ATE  1 -Ip.  This  con¬ 
clusion  follows  also  from  a  comparison  of  the  N-terminal  ar- 
ginylation  of  Asp-pgal  by  the  two  R-transferases  (Table  1). 
The  results  of  coexpressing  mouse  ATEl-lp  and  ATEl-2p  in 
the  same  «/c/A  yeast  cells  were  consistent  with  the  conjecture 
that  R-transferasc  functions  as  a  monomer  (Fig.  6B). 

(vi)  Neither  ATEl-lp  nor  ATEl-2p  could  confer  instability 
on  (or  arginylatc)  Cys-Pgal  in  atel^  S.  cerevisiae  (Fig.  6A  and 
Table  1).  Cys  is  a  stabilizing  residue  in  yeast  but  a  secondary 
destabilizing  residue  in  the  mammalian  N-end  rule  (54).  A 
distinct  Cys-specific  mammalian  R-transferase  suggested  by 
these  data  remains  to  be  identified. 

(vii)  Mouse  ATEl-2p  (tested  as  a  GFP  fusion)  was  exclu¬ 
sively  cytosolic  in  mouse  3T3  cells,  whereas  ATEl-lp  was  lo¬ 
calized  differentially  in  different  cells  of  the  same  (unsynchro- 
nized)  culture:  it  was  either  exclusively  cytosolic  or  present  in 
both  the  cytosol  and  the  nucleus  (Fig.  7). 

(viii)  Mouse  is  a  ubiquitously  expressed  gene.  A  single 
'^^-kb  mRNA  was  present  in  all  of  the  tissues  examined  except 
the  testis,  where  the  major transcript  was  ~2  kb  in  length 
(Fig.  8).  The  testis-specific  differential  expression  patterns  are 
also  characteristic  of  the  other  targeting  components  of  the 
mammalian  N-end  rule  pathway,  such  as  the  7 -encoded 
Asn-spccific  Nt-amidase  and  the  Ubr I -encoded  N-recognin 
(E3a)  (19,  29). 

(ix)  The  molar  ratio  of  Atel-I  to  AteI-2  mRNA  varies  up  to 
a  100-fold  among  different  mouse  tissues  (Fig.  3  and  Results), 
suggesting  a  functional  significance  of  the  difference  between 
the  two  R-transfcrascs. 

The  region  of  ATE  Ip  that  corresponds  to  the  two  129-bp 
mammalian  AteJ  exons  has  been  significantly  conserved 
throughout  eukaryotic  evolution,  Tyr-296,  Gln-297,  and  His- 
301  of  the  mouse  ATEl-lp  being  among  the  most  highly  con¬ 
sented  residues  (Fig.  2C).  No  putative  members  of  the  ATE 
family  could  be  detected  among  the  currently  known  prokary¬ 
otic  ORFs.  The  most  highly  conserved  region  of  R-transferases 
is  an  82-residue  stretch  (residues  336  to  417)  of  mouse  ATElp: 
this  region  is  95,  76,  and  63%  identical  to  the  corresponding 
regions  of  the  human,  D.  melanogaster,  and  A.  thaliana  ATElp, 
respectively  (Fig.  2B).  A  Cys  residue(s)  is  likely  to  be  a  com¬ 
ponent  of  the  active  site  of  R-transferase  (31, 32).  Among  the 
five  fully  conserved  Cys  residues  in  proteins  of  the  ATE  family, 
four  are  located  in  the  56-residue  N-terminal  region  (residues 
23  to  78  of  mouse  ATElp)  (Fig.  2B).  Conversion  of  some  of 
these  cysteines  in  5.  cerevisiae  Ate  Ip  to  alanines  was  found  to 
decrease  greatly  the  R-transferase  activity  of  yeast  Atelp  (16a, 
32).  Furthermore,  derivatives  of  mouse  ATEl-lp  and  ATEl-2p 
that  lacked  the  first  42  residues  were  completely  inactive  in  the 
yeast-based  Asp-Pgal  degradation  assay  of  a  kind  described  in 
Fig.  7  (data  not  shown).  Finally,  a  90-residue  C-terminal  trun¬ 
cation  of  5.  cerevisiae  Atelp  did  not  result  in  a  major  decrease 
of  its  R-transferase  activity  (16a).  Thus,  the  active  site  of  R- 
transferase  is  likely  to  encompass  at  least  some  of  the  above 
N-terminal  cysteines. 

Since  the  two  mammalian  R-transferases  (Fig.  4C)  are  iden¬ 
tical  in  size  and,  except  for  a  43-residue  region,  are  identical 
otherwise  as  well,  it  is  likely  that  the  previously  described 
(partially  purified)  mammalian  R-transferases  (12, 47)  were  in 
fact  mbetures  of  Atel-lp  and  Atel-2p.  On  the  other  hand, 
fractionation  of  a  crude  R-transferase  preparation  from  rabbit 
reticulocytes  did  yield,  in  addition  to  a  major  fraction  of  R- 
transferase,  chromatographically  distinct  R-transferase  frac¬ 
tions  that  were  not  investigated  further  (12).  The  ratio  of 
Atel-lp  to  Atel-2p  in  rabbit  reticulocytes  is  currently  un¬ 
known. 
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exon  of  ATEl-lp  with  the  alternative  129-bp  exon  results  in  a 
protein,  ATEl-2p,  that  has  a  significantly  (possibly  consider¬ 
ably)  lower  R-transferase  activity  (Fig.  6).  In  addition, 
ATEl-2p  is  unable  to  enter  the  nucleus  (as  a  GFP  fusion),  in 
contrast  to  ATEl-lp  (Fig.  7).  Taken  together  with  the  finding 
that  the  expression  ratio  of  the  two^/c7  mRNAs,  Atel-I  and 
Atel -2,  varies  up  to  a  100-fold  among  different  mouse  tissues 
(Fig.  3  and  Results),  these  data  suggest  that  the  two  R-trans¬ 
ferases  are  functionally  distinct  as  well.  Cited  below  are  some 
of  the  possibilities  that  arc  consistent  with  the  available  evi¬ 
dence. 

Mouse  ATEl-2p  has  the  R-transferase  activity  but  arginyl- 
ates,  at  steady  state,  only  ~50%  of  Asp-pgal  in  a/c7A  S.  cer¬ 
evisiae  ^  in  contrast  to  both  ATEl-2p  and  S.  cerevisiae  Atelp 
(Fig.  6  and  Table  1).  Moreover,  the  inefficient  arginylation  by 
ATEl-2p  occurs  in  spite  of  its  overexpression  in  S.  cerevisiae. 
In  contrast,  the  yeast  Atelp,  which  in  wild-type  S.  cerevisiae  is 
a  weakly  expressed  protein  (7),  can  quantitatively  arginylatc  in 
vivo  an  overexpressed  substrate  such  as  Asp-Pgal  (18).  Thus,  at 
a  low  level  of  expression  (which  is  likely  to  be  the  case  in  the 
mouse),  the  ATEl-2p  R-transferasc  may  be,  in  effect,  an  in¬ 
active  enzyme,  in  contrast  to  ATEl-lp.  If  so,  ATE]-2p  might 
act  as  an  (indirect)  inhibitor  of  the  ATEl-lp  function,  for 
example,  through  a  competition  with  ATEl-lp  for  the  binding 
to  a  component  of  the  targeting  complex  in  the  N-end  rule 
pathway.  (The  apparent  absence  of  such  competition  in  S. 
cerevisiae  [Fig.  6B]  may  result  from  the  lack  of  binding  by 
ATElp  to  heterologous  yeast  proteins.)  It  is  also  possible  that 
a  large  difference  in  activity  between  mouse  ATEl-lp  and 
ATEl-2p  in  yeast  reflects  not  their  different  enzymatic  activi¬ 
ties  in  the  mouse  but  a  (physiologically  irrelevant)  differential 
recognition  of  an  essential  yeast  cofactor  such  as  Arg-tRNA. 
Direct  comparisons  of  arginylation  kinetics  by  the  purified 
mouse  and  yeast  R-transferascs  will  be  required  to  address  this 
unlikely  but  unexcludcd  interpretation. 

Another  possibility  is  that  ATEl-2p  has  a  distinct  enzymatic 
activity  that  has  been  missed  by  the  current  N-tcrminal  ar¬ 
ginylation  assay  (Fig.  6  and  Table  1).  For  example,  ATEl-2p 
might  be  able  to  arginylatc  an  internal  residue  in  a  substrate 
protein.  In  vitro  enzymological  dissection  of  ATEl-lp  and 
ATEl-2p  will  address  this  and  related  conjectures.  Yet  an¬ 
other  possibility  is  that  the  alternative  43-residue  regions  of 
ATEl-lp  and  ATEl-2p  confer  different  metabolic  stabilities 
on  the  two  R-transferases,  the  lower  apparent  activity  of 
ATEl-2p  in  yeast  being  due  at  least  in  part  to  its  shorter 
half-life.  A  test  of  this  model  in  mouse  cells  requires  antibodies 
specific  for  the  alternative  regions  of  the  two  R-transferases; 
preparation  of  such  antibodies  is  under  way. 

In  yeast,  the  N-end  rule  pathway  is  present  in  both  the 
cytosol  and  the  nucleus.  The  apparent  exclusion  of  mouse 
ATEl-2p  from  the  nucleus  and  the  different  ratios  of  Atel-1  to 
Atel-2  mRNA  among  the  mouse  tissues  suggest  that  the  rule 
book  of  the  N-end  rule  pathway  may  be  regulated  differentially 
in  the  cytosol  and  the  nucleus,  through  a  cell-type-specific 
e?q)ression  of  the  pathway’s  components  that  are  located  in  one 
but  not  the  other  compartment. 

Physiological  substrates  for  either  eukaryotic  R-transferases 
(54)  or  their  prokaryotic  counterparts,  L,  F-transferases  (1, 25, 
45)  are  not  known.  The  cloning  and  characterization  of  the  first 
mammalian  Atel  cDNAs  and  genes  (Fig.  2),  and  the  discov¬ 
ery  of  alternative  splicing  that  yields  mouse  ATEl-lp  and 
ATEl-2p  (Fig,  4)  should  facilitate  understanding  of  the  func¬ 
tions  of  mammalian  R-transferases,  in  part  through  the  anal¬ 
ysis  of  ATEl-lp  and  ATEI-2p  enzymes  and  also  because  it  is 
now  possible  to  construct  mouse  strains  that  lack  ATEl-lp 
and/or  ATEl-2p. 
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The  split-ubiquitin  technique  was  used  to  detect  transient  protein  interactions  in  living 
cells.  Nyb,  the  N-terminal  half  of  ubiquitin  (Ub),  was  fused  to  Sec62p,  a  component  of  the 
protein  translocation  machinery  in  the  endoplasmic  reticulum  of  Saccharomyces  cercvisiac. 
Cub/  the  C-terminal  half  of  Ub,  was  fused  to  the  C  terminus  of  a  signal  sequence.  The 
reconstitution  of  a  quasi-native  Ub  structure  from  the  two  halves  of  Ub,  and  the  resulting 
cleavage  by  Ub-specific  proteases  at  the  C  terminus  of  C^b/  serve  as  a  gauge  of  proximity 
between  the  two  test  proteins  linked  to  N^b  and  C^b*  Using  this  assay,  we  show  that 
Sec62p  is  spatially  close  to  the  signal  sequence  of  the  prepro-c^-factor  in  vivo.  This 
proximity  is  confined  to  the  nascent  polypeptide  chain  immediately  following  the  signal 
sequence.  In  addition,  the  extent  of  proximity  depends  on  the  nature  of  the  signal 
sequence.  C^b  fusions  that  bore  the  signal  sequence  of  invertase  resulted  in  a  much  lower 
Ub  reconstitution  with  Nub-Sec62p  than  otherwise  identical  test  proteins  bearing  the 
signal  sequence  of  prepro-a-factor.  An  inactive  derivative  of  Sec62p  failed  to  interact 
with  signal  sequences  in  this  assay.  These  in  vivo  findings  are  consistent  with  Sec62p 
being  part  of  a  signal  sequence-binding  complex. 


INTRODUCTION 

A  critical  step  during  the  translocation  of  a  protein 
across  the  membrane  of  the  endoplasmic  reticulum 
(ER)  is  the  interaction  between  the  signal  sequence  of 
a  nascent  polypeptide  and  its  receptors  (Walter  et  al, 
1981;  Gilmore  and  Blobel,  1985;  Walter  and  Johnson, 
1994).  A  stretch  of  8  to  12  hydrophobic  residues,  often 
at  the  N  terminus  of  a  protein,  comprises  a  signal 
sequence  that  is  sufficient  to  initiate  the  protein's 
translocation  into  the  endoplasmic  reticulum  (ER) 
(Rapoport  et  al,  1996).  To  be  compatible  with  a  high 
flux  of  polypeptides  through  a  limited  number  of 
translocation  channels  in  the  ER  membrane,  the  inter¬ 
action  between  the  signal  sequence  and  its  receptors 
has  to  be  short  lived.  Its  transient  nature  makes  such  a 
receptor-ligand  interaction  difficult  to  study,  espe¬ 
cially  in  living  cells.  The  approaches  used  for  the 
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analysis  of  protein  translocation  in  cell-free  systems 
circumvent  the  transience  of  the  signal  sequence-re¬ 
ceptor  interaction  by  pausing  or  stopping  the  synthe¬ 
sis  of  a  nascent  polypeptide  chain  at  different  stages  of 
its  movement  to  and  across  the  ER  membrane  (Krieg  et 
al,  1986;  Kurzchalia  et  ah,  1986;  Connolly  et  al,  1989). 
Given  these  constraints,  it  is  essential  to  verify  in  vivo 
the  models  derived  from  in  vitro  studies.  The  ability  to 
analyze  early  translocation  events  in  vivo  should  also 
be  important  for  defining  the  immediate  environment 
of  the  nascent  chain  on  its  path  from  the  ribosome  to 
the  ER  membrane. 

Most  of  the  current  methods  for  detecting  protein  in¬ 
teractions  in  vivo  either  do  not  operate  at  the  ER  mem¬ 
brane  or  are  unable  to  detect  a  transient  proximity  be¬ 
tween  proteins  (Fields  and  Song,  1989;  Aronheim  et  al, 
1997;  Rossi  et  al,  1997;  Miyawaki  et  al,  1997).  In  the 
present  work,  we  show  that  the  previously  developed 
split-ubiquitin  (split-Ub)  technique,  also  called  USES 
(Ub/split/protein/sensor)  (Johnsson  and  Varshavsky, 
1994a,  199^,  is  capable  of  detecting  a  transient  in  vivo 
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Figure  1.  The  split-Ub  technique  and  its  application  to  the  analysis 
of  protein  translocation.  (A)  and  C^,t,  are  linked  to  the  interact¬ 
ing  proteins  X  and  Y.  The  X-Y  complex  brings  N^b  and  Qb  into 
close  proximity.  N^b  and  C^b  reconstitute  a  quasi-native  Ub  moiety, 
which  is  cleaved  by  the  UBPs,  yielding  the  free  reporter  R  (Johnsson 
and  Varshavsky,  1994a).  (B)  Using  split-Ub  to  monitor  the  proximity 
between  a  secretory  protein  and  a  component  of  the  translocation 
machinery.  A  signal  sequence-bearing  C^b  fusion  (SS-Y-Cub'R)  and 
a  N^b  fusion  (N^b-X)  are  coexpressed  in  a  cell.  Pathway  1:  when  N^b 
is  linked  to  a  protein  not  involved  in  the  targeting  for  translocation, 
the  uncleaved  (except  for  the  signal  sequence  SS)  -R  enters  the 
ER.  Pathway  2:  when  N„b  is  linked  to  a  protein  involved  in  the 
targeting  for  translocation  the  signal  sequence  of  the  SS-Y-Cyb-R 
brings  N^b  and  C^b  into  close  contact.  As  a  result,  some  of  the 
SS-Y-Cub-R  and  N^b'X  molecules  interact  to  form  the  quasi-native 
Ub,  yielding  the  free  reporter  R  in  the  cytosol. 

interaction  between  polypeptides.  The  split-Ub  method 
is  based  on  the  ability  of  and  the  N-  and 
C-terminal  halves  of  Ub,  to  assemble  into  a  quasi-native 
Ub.  Ub-specific  proteases  (UBPs),  which  are  present  in 
all  eukaryotic  cells,  recognize  the  reconstituted  Ub,  but 
not  its  halves,  and  cleave  the  Ub  moiety  off  a  reporter 
protein  that  had  been  linked  to  the  C  terminus  of  Cub- 
The  liberation  of  the  reporter  serves  as  a  readout  indi¬ 
cating  the  reconstitution  of  Ub.  The  assay  is  designed  in 
a  way  that  prevents  efficient  association  of  Nub  Cub 
by  themselves,  but  allows  it  if  the  two  Ub  halves  are 
separately  linked  to  proteins  that  interact  in  vivo  (Figure 
lA).  The  split-Ub  assay  has  been  shown  to  detect  the  in 
vivo  dimerization  of  a  leucine  zipper-containing  domain 
of  the  Gcn4p  transcriptional  activator,  and  the  in  vivo 
interaction  between  two  subunits  of  the  oligosaccharyl- 
transferase  complex  (Johnsson  and  Varshavsky,  1994a; 
Stagljar  et  al,  1998). 

In  the  present  work,  we  focus  on  the  interaction 
between  Sec62p  of  the  yeast  Saccharomyces  cerevisiae 


and  proteins  bearing  two  different  signal  sequences. 
Extensive  evidence  indicates  that  Sec62p  is  a  compo¬ 
nent  of  the  ER  translocation  machinery  (Deshaies  and 
Schekman,  1989;  Rothblatt  et  aL,  1989;  Miisch  et  at., 
1992).  Sec62p  is  a  part  of  the  tetrameric  Sec62/63 
complex  that  also  contains  Sec71p  and  Sec72p  (De¬ 
shaies  et  aL,  1991;  Feldheim  and  Schekman,  1994). 
Sec62/ 63p  can  be  isolated  as  a  tetramer,  or  as  a  part 
of  a  larger  assembly,  the  heptameric  Sec  complex 
(Panzner  et  aL,  1995).  In  addition  to  the  Sec62/63 
complex,  the  heptamer  contains  the  trimer  of  Sec61p. 
This  trimer  (Sec61p,  Ssslp,  Sbhlp  in  yeast;  Sec61a, 
Sec61j8,  Sec6l7  in  mammals)  forms  the  aqueous  chan¬ 
nel  through  which  a  polypeptide  chain  is  translocated 
across  the  ER  membrane  (Simon  and  Blobel,  1991; 
Gorlich  et  at.,  1992;  Crowley  et  aL,  1993,  1994;  Mothes 
et  aL,  1994;  Hanein  et  aL,  1996;  Beckmann  et  aL,  1997). 

The  role  of  the  Sec62/63  tetramer  is  less  well  de¬ 
fined.  Cross-linking  and  reconstitution  experiments  in 
vitro  have  shown  that  Sec62p  is  close  to  the  nascent 
polypeptide  chain  before  the  initiation  of  its  translo¬ 
cation  (Musch  et  aL,  1992;  Lyman  and  Schekman,  1997; 
Matlack  et  aL,  1997).  One  important  role  of  Sec63p  is  its 
ability  to  recruit  the  Hsp70-type  protein  Kar2p  of  the 
ER  lumen  to  the  vicinity  of  a  translocating  polypeptide 
(Brodsky  and  Schekman,  1993;  Lyman  and  Schekman, 
1997).  TTie  Sec62/63  complex  is  essential  for  the  post- 
translational  translocation  of  proteins  in  reconstituted 
vesicle  preparations  (Panzner  et  aL,  1995).  Genetic 
analysis  supports  this  conclusion,  by  showing  that  the 
tetrameric  Sec62/63  complex  is  involved  in  the  trans¬ 
location  of  proteins  whose  targeting  to  the  ER  mem¬ 
brane  is  not  abolished  by  the  loss  of  the  signal  recog¬ 
nition  particle  (SRP)  (Ng  et  aL,  1996).  However,  it  is 
less  clear  whether  the  Sec62/63  complex  is  the  recep¬ 
tor  for  the  signal  sequences  of  those  proteins. 

In  the  present  work,  we  demonstrate  the  ability  of 
the  split-Ub  assay  to  detect  transient  protein  interac¬ 
tions  in  living  cells.  We  show  that  the  assay  can  mon¬ 
itor  a  close  proximity  between  Sec62p  and  a  segment 
of  the  nascent  chain  of  a  signal  sequence-bearing  pro¬ 
tein.  The  apparent  extent  of  this  proximity  is  influ¬ 
enced  by  the  nature  of  the  signal  sequence  and  the 
position  of  Cub  Ihe  nascent  polypeptide.  Our  anal¬ 
ysis  yields  a  crude  map  of  the  environment  of  the 
nascent  chain  during  its  targeting  to  and  translocation 
across  the  ER  membrane.  Taken  together,  these  find¬ 
ings  are  the  first  in  vivo  evidence  that  Sec62p,  a  com¬ 
ponent  of  the  translocation  machinery  in  the  endo¬ 
plasmic  reticulum,  is  a  part  of  a  signal  sequence¬ 
binding  complex. 

MATERIALS  AND  METHODS 
Construction  of  Test  Proteins 

The  Cub  fusions  8-13  (Figure  2)  were  derived  from  the  construct  I  of 
Johnsson  and  Varshavsky  (1994a),  which  encoded  Ub-DHFR*ha  and 
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A  Nub  Fusions 


1.  N„5-Sec62p 

2.  Nui]-Sec62(^Qeo)P 

3.  Nui,-Sec61p 

4.  Nub-Ste14p 

5.  Nyb*Bos1p 

6.  Nyb'SedSp 

7.  Nub-Tpilp 


Nubu-^ga-C  Golgi 


Nub 


B  Cub  Fusions 


8.  Mfal37Cub*R 

9.  Mf(x1 

10.  Suc223Cub*R 

11.  Suc233C|jb"R 

12.  Suc259C|jb~R 

13.  Suc2g-|  gC||b*R 

14.  Tpl1Cub-R 

15.  ASuc223C|jb*R 

16.  ASuc259Cub“R 


XZIZIZCub^^ 

irz2:z2i2  Cub^^ 


Figure  2.  Nub  ^nd  Cub  fusions.  (A)  Nub  (residues  1-36  of  Ub)  was  fused  to  the  N  terminus  of  either  a  transmembrane  protein  (constructs 
1-6)  or  a  cytosolic  protein  (construct  7).  The  N  termini  of  all  proteins  are  located  in  the  cytosol.  The  orientations  and  the  numbers  of 
membrane-spanning  regions  (shaded  boxes)  were  derived  from  the  published  studies  of  these  proteins,  except  for  Stel4p,  for  which  the  exact 
number  of  the  domains  and  the  localization  of  the  C  terminus  are  not  yet  known.  The  Nub  fusions  1-5  are  located  in  the  membrane  of  the 
ER;  the  Nub  fusion  6  resides  in  the  membrane  of  the  early  Golgi.  The  Nyb  fusion  2  is  a  Sec62p  derivative  lacking  the  C-terminal  60  residues. 
The  Nub  fusion  7  contains  the  full-length  triosephosphate  isomerase  (Nyb-Tpilp).  (B)  Cub  fusions.  The  Cub  fusions  8  and  9  contain  the  signal 
sequence  of  the  prepro-a-factor  (shaded  boxes),  followed  by  either  37  (construct  8)  or  65  residues  (construct  9)  of  the  mature  a-factor  sequence 
(striped  boxes)  and  a  7-residue  linker  sequence  (not  shown).  Cub  fusions  10-13  contain  the  signal  sequence  of  the  Suc2p  invertase  (dark  boxes) 
followed  by  23  (construct  10),  33  (construct  11),  59  (construct  12),  or  518  residues  (construct  13)  of  the  mature  sequence  of  invertase  (open 
boxes)  and  a  7-residue  linker  sequence  (not  shown).  The  Cub  fusion  14  contains  the  complete  sequence  of  S.  cerevisiae  triosephosphate 
isomerase  (Tpilp)  followed  by  a  17-residue  linker  peptide  and  Cub-  The  Cub  fusions  15  and  16  are  the  signal  sequence-lacking  counterparts 
of  the  fusions  10  and  12.  Cub  is  always  followed  by  a  reporter  protein.  The  reporter  is  DHFR-ha  or  Ura3p  for  the  Cub  fusions  8-13,  and 
DHFR-ha  for  the  Cub  fusions  14,  15,  and  16. 


contained  a  BamHl  site  at  the  amino  acid  position  36  of  Ub,  and 
from  the  previously  described  Ub  translocation  constructs  I,  VI,  IX, 
X,  XXIII,  and  XXV  (Johnsson  and  Varshavsky,  1994b).  The  above 
BamHl  site  of  the  Ub-DHFR-ha  construct  I  was  fused  to  a  linker 
sequence  in  which  a  5' -Sail  site  allowed  the  in-frame  insertion  of  an 
Eagl-Sall  fragment  containing  the  promoter,  the  signal  sequence, 
and  a  portion  of  the  mature  sequence  of  the  corresponding  Ub 
fusions.  The  newly  introduced  sequence  was  G  TCG  ACC  ATG 
TCG  GGG  GGG  ATC  CCT.  The  last  three  triplets  encode  residues 
35,  36,  and  37  of  Ub  (the  beginning  of  Cub)-  The  underlined  se¬ 
quences  are  the  Sail  and  BamHl  sites,  respectively.  The  final  con¬ 
structs  were  in  the  single-copy  plasmids  pl&314  or  pRS315  (Sikorski 
and  Hieter,  1989).  Expression  of  the  Cub  fusions  bearing  Dha 
(DHFR-ha)  as  a  reporter  was  mediated  by  the  Padhi  promoter, 
except  for  the  Cub  fusion  14,  which  was  expressed  from  the  Pcupi 
promoter.  The  same  promoter  was  used  for  expressing  the  Ura3p- 
based  Cub  fusions. 

The  Cub  fusions  15  and  16  (Figure  2)  were  derived  from  constructs 
10  and  12  by  deleting  the  Hiudlll  fragment  spanning  the  first  four 
codons  of  the  SUC2  ORF  and  a  short  portion  of  the  polylinker 
sequence  between  the  3 '-end  of  the  Padhi  promoter  and  the  SUC2 
ORF.  As  a  result,  the  translation  of  Cub  fusions  15  and  16  began  at 
the  first  codon  of  the  mature  invertase,  skipping  its  signal  sequence. 
The  Cub  fusion  14  (Figure  2)  was  produced  through  an  in-frame 
fusion  of  a  PCR  fragment  containing  the  complete  TPll  coding 


sequence  and  Cub'Dha.  The  sequence  between  TPll  and  Cub  is  as 
follows:  AAC  GGG  TCG  ACC  GAC  TAC  AAG  GAC  GAC  GAT 
GAC  AAG  GGC  TCG  ACC  ATGTCG  GGG  GGG  ATC  CCT.  The 
underlined  sequences  indicate,  respectively,  the  last  codon  of  TPll 
and  the  first  three  codons  of  Cub- 
A  fragment  encoding  Nub“Sec62p  was  constructed  using  PCR 
amplification  of  a  1050  base  pair  (bp)  fragment  containing  the  SEC62 
ORF.  PCR  introduced  a  BamHl  site  and  a  linker  sequence  in  front  of 
the  start  codon  of  SEC62  and  an  Xhol  site  173  bp  downstream  of  the 
stop  codon.  The  Pcupr^ub  modules  were  cloned  as  BamHl  frag¬ 
ments  in  frame  with  the  SEC62  ORF.  The  sequence  between  Nyb  and 
SEC62  is  GGG  ATC  CCT  TCT  GGG  ATG.  The  first  three  codons 
encode  residues  35, 36,  and  37  of  Nub.  followed  by  the  Gly-Ser  linker 
and  the  start  codon  of  SEC62.  The  BamHl  site  is  underlined.  The 
final  constructs  resided  in  pRS316  or  pRS313.  Nub-TP/f.  Nub-S£D5, 
Nub-STEI4  (a  gift  from  N.  Lewke),  and  Nub-Sec62(AC60)-Dha  were 
constructed  similarly  to  N„b-SEC62.  With  the  exception  of  Nyb- 
Sec62(AC60)-Dha,  which  was  placed  in  pRS316  and  pRS313,  all  of 
these  fusions  resided  in  pRS314.  The  linker  connecting  codon  35  of 
Nub  fho  first  codon  of  a  linked  gene  was  GGG  ATC  CCT  GGG 
GAT  ATG  for  Nyb-TPH  and  Nub-SED5,  and  GGG  ATC  CCT  GGG 
GAT  CAC  for  Nyb-STEH.  Underlined  are  the  BamHl  site  and  the 
first  codon  of  the  linked  gene.  The  sequence  GGG  TCG  ACC  TTA 
ATG  CAG  AGA  TCT  GGC  ATC  ATGGTT  connected  the  last  codon 
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Table  1.  Yeast  strains 

Strain 

Relevant  genotype 

Source /comment 

YPH500 

MATa  ade2-101  his3-A200  ku2-A1  lys2-801  trpl-A63  ura3-52 

Sikorski  and  Hieter  (1989) 

JD53 

MATa  his3-A200  kii2-3,lU  lys2-80i  trpl-A63  ura3-52 

Dohmen  et  al.  (1995) 

RSY529 

MATa  his4  ku2-3,m  ura3-52  sec62-l 

Deshaies  and  Schekman  (1989) 

NJY51 

MATa  ade2-Wl  his3-A200  ku2-A1  ]ys2-801  trpl-A63  urci3-52 
NUB-BOSl::pRS306 

Derivative  of  YPH500 

NJY62-1 

MATa  his3-A200  ku2-3,n2  lys2-801  trpl-A63  ura3-52 
NUB-SEC62r.pRS306 

Derivative  of  JD53 

NJY73-I 

MATa  his3-A200  ku2-3,112  lys2-801  trpl-A63  ura3-52 
NUB-BOSl::pRS303 

Derivative  of  JD53 

NJY74-I 

MATa  his3-A200  ku2-3,112  Iys2-801  trpl-A63  ura3-52 
NUB-BOSl::pRS306 

Derivative  of  JD53 

NJY61-I 

MATa  his3-A200  ku2-3,112  Iys2-801  trpl-A63  ura3-52 
NUB-SEC61::pRS304 

Derivative  of  JD53 

NJY61-A 

MATa  his3-A200  ku2-3,in  Iys2-80-I  trp1-A63  ura3-52 
NUA-SEC61::pRS304 

Derivative  of  JD53 

NJY61-G 

MATa  his3-A200  ku2-3,n2  Iys2-801  trpl-A63  ura3^52 

Derivative  of  JD53 

NUG-SEC61::pRS304 


of  SEC62  in  Nub-Sec62(AC60)-Dha  (codon  223,  underlined)  to  the 
first  two  codons  of  DHFR  (underlined). 

N^^rBOSl  was  constructed  in  part  by  PCR  amplification,  with 
two  synthetic  oligos  and  yeast  genomic  DNA  as  a  template,  yielding 
a  258"bp  fragment  containing  the  first  229  bp  of  the  BOSl  ORF. 
Upstream  of  the  BOSl  ATG  was  a  short  linker  sequence  and  a 
BamHl  site,  to  allow  in^frame  fusion  of  the  Pcupi  promoter-N^b 
module.  The  sequence  between  N„b  and  BOSl  reads:  GGG  ATC 
CCT  CCA  GGA  ATG,  The  first  four  triplets  encode  residues  35,  36, 
37,  and  38  of  N^b/  followed  by  the  Gly  codon  and  the  start  codon  of 
BOSl.  The  BamHl  site  is  underlined.  The  3'-region  of  the  resulting 
fragment  terminated  in  a  Sail  site  for  insertion  into  the  integrating 
vectors  pRS306  or  pRS303.  The  vector  was  cut  at  the  unique  EcoRI 
site  in  the  BOS3 -containing  fragment  and  transformed  into  S.  cer- 
evisiae  strains  YPH500  and  JD53  to  produce,  through  homologous 
recombination,  the  integrated  cassette  that  expressed  N^b^Boslp 
from  the  Pcun  promoter.  The  presence  of  the  desired  gene  fusion 
and  the  absence  of  wild-type  BOSl  were  verified  by  PCR. 

An  integrated  copy  of  Pcupi-Nub'S£C62  was  produced  by  ampli¬ 
fying  the  first  438  bp  of  the  SEC62  ORF,  and  then  cloning  it,  using 
the  BamHl  and  EcaRI  restriction  sites,  in  frame  behind  the  pRS306- 
Pcupi-Nub  cassette.  A  tmique  Aflll  site  in  the  SEC62  ORF  was  used 
to  linearize  the  plasmid  for  transformation  and  integration  at  the  S. 
cerevisiae  SEC62  gene,  yielding  the  strain  NJY62-L  The  N-terminal 
147-residue  fragment  of  Sec62p  that  was  coexpressed  with  N^b- 
Sec62p  in  the  resulting  strain  has  previously  been  shown  to  be 
inactive  in  translocation  (Deshaies  and  Schekman,  1990).  Nub-SEC6I 
was  constructed  by  targeted  integration  of  a  Nyb-SEC61- containing 
fragment  into  SEC61  of  the  S.  cerevisiae  strain  JD53  (Table  1).  Spe¬ 
cifically,  a  fragment  containing  the  first  875  bp  of  the  SEC61  ORF 
was  amplified  by  PCR  and  inserted  downstream  of  the  pRS304-  or 
‘pRS303-based  PcuprNub  cassette,  using  the  flanking  BamHl  and 
EcoRI  sites.  The  linker  sequence  between  N^b  and  SEC61  was  GGG 
ATC  CCT  GGG  TCT  GGG  ATG.  Underlined  are  the  BamHl  site  and 
the  first  codon  of  SEC61.  For  targeted  integration,  the  plasmid  was 
linearized  at  the  unique  Stul  site  in  the  SEC61  ORF  to  create  the 
yeast  NJY61-I.  A  detailed  description  of  the  NJY61  strains  (Table  1) 
will  be  presented  elsewhere  (Wittke  and  Johnsson,  unpublished 
data). 

All  of  the  Pcupi  promoter-controlled  ORFs  were  expressed  under 
noninducing  conditions  (no  copper  added  to  the  medium),  except  in 
the  experiment  shown  in  Figure  5B,  where  cells  were  incubated  in 
the  presence  of  0.1  mM  CUSO4. 


Immunoblotting 

Proteins  fractionated  by  SDS-12.5%  PAGE  were  electroblotted  onto 
nitrocellulose  (Schleicher  &  Schuell,  Dassel,  Germany)  or  polyvi- 
nylidene  difluoride  (Machery-Nagel,  Diiren,  Germany)  membranes, 
using  the  semidry  transfer  system  (Hoeffer  Pharmacia  Biotech,  San 
Francisco,  CA).  Blots  were  incubated  with  an  anti-ha  monoclonal 
antibody  (Babco,  Richmond,  CA)  and  visualized  using  horseradish 
peroxidase-coupled  goat  anti-mouse  antibody  (Bio-Rad,  Hercules, 
CA),  the  chemiluminescence  detection  system  (Boehringer,  Mann¬ 
heim,  Germany),  and  x-ray  films.  Where  indicated,  quantification 
was  performed  using  the  Lumi  Imager  system  (Boehringer). 

Pulse-Chase  Analysis 

Yeast-rich  (YPD)  and  synthetic  minimal  media  with  2%  dextrose 
(SD)  were  prepared  as  described  previously  (Dohmen  et  al.,  1995).  S. 
cerevisiae  cells  expressing  the  N^b  and  C^b  fusions  were  grown  at 
30°C  in  10  ml  of  SD  medium  without  externally  added  copper  to  an 
ODgQo  of  ~1  and  labeled  for  5  min  with  Redivue  Promix-p^S] 
(Amersham,  Buckinghamshire,  United  Kingdom),  followed  (either 
directly  or  after  a  chase)  by  immunoprecipitation  with  the  anti-ha 
monoclonal  antibody,  essentially  as  described  by  Johnsson  and 
Varshavsky  (1994a,b).  The  EndoH  analysis  of  glycosylated  proteins 
was  carried  out  as  described  by  Orlean  et  al  (1991).  Samples  were 
concentrated  before  SDS-12.5%  PAGE  by  precipitation  with  chloro¬ 
form/methanol.  Gels  were  fixed  and  enhanced  for  fluorography. 
For  quantitative  analysis,  a  dried  gel  was  exposed  and  scanned 
using  a  Phosphorlmager  (Molecular  Dynamics,  Sunnyvale,  CA). 


RESULTS 

Experimental  Strategy 

The  use  of  split  Ub  to  monitor  the  proximity  between 
the  proteins  X  and  Y  requires  the  construction  of  two 
"complementary"  fusion  proteins.  One  fusion  bears 
N„b  (see  INTRODUCTION)  linked  to  X  and 

the  other  bears  C^b  linked  to  both  Y  and  a  reporter 
protein  R  at  the  C  terminus  of  C^b  ^(Y-Cub-R)*  The 
liberation  of  the  reporter  through  the^Ub-dependent 
cleavage  by  UBPs  indicates  the  in  vivo  reconstitution 
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of  a  quasi-native  Ub  from  N^b  and  C^b*  In  the  split-Ub 
assay,  the  efficiency  of  cleavage  at  the  C  terminus  of 
Cub  Y-Cyb-R  is  measured  relative  to  the  efficiency  of 
cleavage  obserx^ed  with  selected  reference  (control) 
proteins  (Figure  1). 

To  monitor  protein  interactions  during  translocation 
of  a  protein  across  the  ER  membrane,  N^b  was  fused  to 
the  N  terminus  of  a  membrane  protein  that  is  a  part  of 
the  translocation  machinery  (Figure  1).  Owing  to  the 
constraint  of  the  assay,  which  requires  the  cytosolic 
location  of  the  reconstituted  Ub,  the  N  terminus  of  this 
membrane  protein  must  be  located  in  the  cell's  cy¬ 
tosol.  Sec62p  has  an  N-terminal  cytosolic  domain  of 
158  residues,  which  is  followed  by  two  membrane- 
spanning  segments  and  a  C-terminal  segment  also 
facing  the  cytosol  (Deshaies  and  Schekman,  1990).  N^b 
was  therefore  fused  to  the  N  terminus  of  Sec62p, 
yielding  Nub-Sec62p.  C^b  was  sandwiched  between 
the  56  N-terminal  residues  of  the  precursor  of  S.  cer- 
evisiae  a-factor  pheromone  (prepro-a-factor)  and  the 
ha  epitope-tagged  mouse  dihydrofolate  reductase 
(DHFR-ha;  denoted  as  Dha)  as  a  reporter  protein, 
yielding  Mfa37-Cub"Dha  (Figure  2).  The  cleavage  of  the 
Cub^containing  fusion  at  the  Cub^Dha  junction  was 
detected  with  a  monoclonal  anti-ha  antibody. 


Split-Ub  Detects  a  Proximity  between  a 
Translocating  Protein  and  Sec62p 

We  first  verified  that  Mfa37-Cub"Dha  could  be  translo¬ 
cated  across  the  ER  membrane  and  that  the  N-terminal 
extension  of  Sec62p  with  N^b  did  not  interfere  with 
the  Sec62p  function  in  translocation.  After  a  5-min 
pulse  of  wild-type  S.  cerevisiae  with  ^^S-methionine, 
the  labeled  MfQ:37-Cub“E)ha  was  immunoprecipitated 
as  a  glycosylated  and  undipped  fusion  (Figure  3A). 
Thus,  Mfa37-Cub"Dh3  could  indeed  be  translocated 
into  the  lumen  of  ER.  Introduction  of  the  same  Mfa37- 
Cub-Dha  construct  into  the  yeast  strain  RSY529,  which 
carries  a  temperature-sensitive  (ts)  variant  of  Sec62p 
(Rothblatt  et  al,  1989),  confirmed  the  severe  transloca¬ 
tion  defect  of  this  strain.  About  50%  of  the  pulse- 
labeled  Mfa37-Cub*Dha  entered  the  lumen  of  the  ER  in 
this  strain  at  the  semipermissive  temperature  of  30°C, 
while  the  rest  remained  in  the  cytosol  (Figure  3A). 
Thus,  the  translocation  of  Mfa37-Cub-Dha  depends  on 
Sec62p.  This  made  it  possible  to  determine  whether 
Nub“&c62p  is  functionally  active.  The  test  utilized 
Nug-Sec62p,  in  which  the  N-terminal  half  of  Ub  con¬ 
tained  Gly-13  instead  of  wild-type  Ile-13.  This  deriv¬ 
ative,  denoted  as  N^g,  has  a  lower  affinity  for  C^b  than 
the  wild-type  N^b  Qohnsson  and  Varshavsky,  1994a). 
We  chose  Nug-Sec62p  for  this  experiment  to  minimize 
the  reconstitution  of  the  Ub  moiety  through  interac¬ 
tions  between  Nub-Sec62p  and  potentially  arrested 
molecules  of  Mfa37-Cub"Dha,  which  might  be  localized 
in  the  cytosol.  Plasmids  expressing  Nug-Sec62p  and 
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Nub-Sec62p 

Sec62(AC60)Dha 

Nub-Sec62(AC60)Dha 


Figure  3.  Sec62p  is  close  to  the  signal  sequence  of  the  a-factor 
precursor.  (A)  S.  cerevisiae  cells  expressing  Mta^Cub-Dha  (construct 
8;  Figure  2)  were  labeled  for  5  min  with  -^^S-methionine.  The  ex¬ 
tracted  proteins  were  immunoprecipitated  with  anti-ha  antibody, 
followed  by  a  mock  treatment  (lanes  a,  c,  e,  and  g)  or  the  treatment 
with  EndoH  (lanes  b,  d,  f,  and  h),  and  SDS-PAGE.  The  results  with 
cells  coexpressing  N^g-  or  Nui,-Sec62p  are  shown  in  lanes  c  and  d 
and  g  and  h,  respectively.  The  analysis  Was  performed  with  N^g- 
Sec62p  in  the  S.  cerevisiae  mutant  RSY529  carrying  a  ts  allele  of 
SEC62  (lanes  a-d)  or  with  Nub-Sec62p  in  the  wild-type  yeast  (lanes 
e-h).  Number  8  (following  the  numbering  of  the  constructs  in 
Figure  2)  on  the  right  indicates  the  positions  of  uncleaved  Mfa37- 
Cub-Dba  and  its  glycosylated  forms.  An  asterisk  denotes  an  unre¬ 
lated  yeast  protein  that  cross-reacts  with  the  anti-ha  antibody.  (B) 
Nub-Sec62p  encodes  a  functionally  active  protein.  RSY529  cells  car¬ 
rying  an  empty  plasmid  (a),  Sec62p  (b),  Nub-Sec62p  (c), 
Sec62(AC60)Dha  (d),  or  Nub-Sec62(AC60)Dha  (e)  were  spotted  on 
minimal  media  and  grown  for  2  d  at  30°C  (semipermissive  temper¬ 
ature  for  unmodified  RSY529). 


Mfa37-Cub“Dha  were  cotransformed  into  RSY529  cells 
and  assayed  at  30°C.  As  in  wild-type  cells,  only  trans¬ 
located  Mfa37-Cub‘Dha/  but  virtually  no  free  Dha  or 
nontranslocated  MfagT-Cub-Dba,  was  detected  after 
immunoprecipitation  and  EndoH  treatment  of  the 
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cells  that  had  been  labeled  for  5  min  with  '^'’S-methi- 
onine  (Figure  3A). 

To  test  Nyt,-Sec62p  directly  is  the  wild-type  half 
of  Ub,  containing  lie  at  position  13),  we  examined  its 
ability  to  complement  the  growth  defect  of  RSY529 
cells.  RSY529  cells  expressing  Nut,-Sec62p  were  found 
to  grow  at  the  semipermissive  temperature  of  30°C,  in 
contrast  to  congenic  cells  carrying  a  control  plasmid 
(Figure  3B).  To  verify  that  the  suppression  of  the  ts 
phenotype  was  not  due  to  the  initiation  of  translation 
from  the  first  (internal)  ATG  codon  of  Sec62p  within 
the  Nub-Sec62p  fusion,  the  rescue  experiment  was  suc¬ 
cessfully  repeated  with  the  otherwise  identical  deriv¬ 
ative  of  Nub-Sec62p  that  lacked  the  first  ATG  of  SEC62 
(our  unpublished  results). 

A  significant  amount  of  free  Dha  was  generated 
when  Mfa37-Cub‘Dha  was  expressed  (in  either  wild- 
type  or  RSY529  cells)  together  with  Nub-Sec62p,  which 
contained  the  wild-type  half  of  Ub  (Figure  3A  and  our 
unpublished  results).  We  concluded  that  Sec62p  is 
close  to  the  nascent  polypeptide  chain  during  its  trans¬ 
location  into  the  ER.  The  cleavage  at  the  C  terminus  of 
Cub  requires  its  interaction  with  N^b  and  depends  on 
the  presence  of  UBPs  (Johnsson  and  Varshavsky, 
1994a).  Since  UBPs  have  previously  been  shown  to  be 
absent  from  the  ER  (Johnsson  and  Varshavsky,  1994b), 
the  free  Dha  moiety  had  to  be  produced  in  the  cytosol. 
Fractionation  experiments  confirmed  that  free  Dha 
was  absent  from  membrane-enclosed  compartments 
in  whole-cell  extracts  (our  unpublished  results).  An 
entirely  independent  evidence  for  this  conclusion  was 
produced  by  replacing  Dha  in  Mfa37-Cub“Dha  with 
Ura3p  as  the  reporter  moiety.  Ura3p  confers  the  Ura"^ 
phenotype  on  tiraSA  cells  only  if  Ura3p  has  access  to 
the  cytosol  (Johnsson  and  Varshavsky,  1994b).  In  our 
tests,  the  cytosolic  Ura3p  was  produced  only  if  Mfa37- 
Cub-Ura3p  was  coexpressed  with  Nub-Sec62p  (com¬ 
pare  A  and  B  in  Figure  7),  in  agreement  with  the  other 
evidence  (see  above)  that  the  cleavage  at  the  C^b- 
protein  junction  takes  place  exclusively  in  the  cytosol. 

The  transient  nature  of  the  proximity  between 
Sec62p  and  the  nascent  chain  of  a  translocated  protein 
was  indicated  by  the  near-absence  of  the  released  Dha 
moiety  if  Mfa37-Cub-Dha  was  coexpressed  with  either 
Nug-Sec62p  or  Nua-Sec62p  instead  of  Nub-Sec62p  (N^a 
denotes  Ala  at  position  13  of  N^b);  by  contrast,  the 
same  experiment  with  Nub-Sec62p  resulted  in  a  signif¬ 
icant  cleavage  of  Mfa37-Cub‘Dha  (Figures  3A  and  6C). 
Previous  work  (Johnsson  and  Varshavsky,  1994a)  has 
shown  that  N^a  and  N^g  can  induce  significant  Ub 
reconstitution  when  either  of  them  and  C^b  are  linked 
to  polypeptides  that  form  a  stable  (long-lived)  com¬ 
plex  in  a  cell.  In  summary,  the  observed  absence  of 
significant  Ub  reconstitution  with  N^a  and  N^g  (in 
contrast  to  N^b)  was  interpreted  to  signify  a  close  but 
transient  (short-lived)  proximity  between  Sec62p  and 
Mfa37-Cub~Dha. 


Specificity  of  the  Spatial  Proximity  betiveen  a 
Signal  Sequence-bearing  Nascent  Polypeptide  and 
Sec62p 

A  commonly  used  negative  control  in  a  translocation 
assay  is  a  protein  with  a  defective  or  absent  signal 
sequence  (Allison  and  Young,  1988;  Miisch  et  a\., 
1992).  Such  a  control  is  not  entirely  compatible  with 
spatio-temporal  aspects  of  the  split-Ub  assay.  Specifi¬ 
cally,  a  C^b-fusion  protein  lacking  a  signal  sequence 
accumulates  in  the  cytosol  (where  the  split-Ub  assay 
operates),  whereas  an  analogous  signal  sequence¬ 
bearing  protein  is  continuously  removed  from  this 
compartment.  A  direct  comparison  between  reactions 
that  involve  a  signal  sequence-bearing  polypeptide 
and  its  signal  sequence-lacking  counterpart  requires 
the  ability  to  compare  the  local  concentrations  of  the 
two  polypeptides  at  the  site  of  translocation.  We  are 
not  aware  of  an  in  vivo  technique  that  would  be 
independent  of  the  split-Ub  assay  and  at  the  same 
time  would  allow  a  measurement  of  these  parameters. 
Therefore,  we  devised  an  alternative  control.  The  ex¬ 
tent  of  cleavage  of  Mfa37-C^,b-Dha  at  the  C^b-Dha 
junction  should  reflect  the  time-averaged  spatial  prox¬ 
imity  between  the  nascent  Mfa-chain  and  a  coex¬ 
pressed  N^b'Coritaining  fusion.  By  comparing  the  ex¬ 
tent  of  cleavage  of  Mfa37-Cub*’Dha  in  the  presence  of 
N^jb’Sec62p  (Figure  3A)  with  the  analogous  activity  of 
Nub'fusion  proteins  that  are  not  involved  in  the  ER 
targeting  and  translocation,  we  could  assess  the  spec¬ 
ificity  of  the  reaction  between  Nyb"Sec62p  and  Mfa37- 
C„b-Dha. 

Four  N^b-fusion  proteins,  N^b’^oslp,  N^b-SteMp, 
Nub-Sed5p,  and  N^b-Tpilp  were  tested  in  the  split-Ub 
assay  with  Mfa37-Cub“Dha.  The  expected  intracellular 
locations  of  these  fusions,  and  their  predicted 
topologies  in  the  membrane  are  shown  in  Figure  2A 
(Shim  et  al,  1991;  Banfield  et  al,  1994;  Sapperstein  et 
ah,  1994;  Lewke  and  Johnsson,  unpublished  data).  We 
found  that,  in  contrast  to  N^_jb“Sec62p,  none  of  the  four 
tested  N^b  fusions  induced  a  significant  cleavage  of 
Mfa37-C^,b-Dha  (Figure  4A).  A  small  amount  of  free 
Dha  could  be  detected  in  the  immunoblots  when  N^b" 
Boslp  was  overexpressed.  The  lack  of  significant  Ub 
reconstitution  from  N^b  and  C^b  upon  coexpression  of 
Mfa37-Cub“Dha  and  the  N^b-niodified  ER  membrane 
proteins,  Boslp,  Stel4p  (Figure  4A),  and  Secl2p  (Na- 
kano  et  al,  1988;  our  unpublished  results),  confirmed 
that  the  steady-state  concentration  of  Mfa37-Cub"Dha 
in  the  cytosol  was  extremely  low. 

To  verify  that  the  observed  absence  of  Ub  reconsti¬ 
tution  (Figure  4A)  was  not  due  to  either  low  concen¬ 
trations  of  the  tested  fusion  proteins  or  reduced  acces¬ 
sibility  of  their  linked  N^b  moieties,  we  compared  the 
activity  of  these  N^b  fusions  toward  a  cytosolic  C„b- 
fusion  protein.  C^b-Dha  was  fused  to  the  C  terminus 
of  the  cytosolic  enzyme  triosephosphate  isomerase 
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Figure  4.  The  in  vivo  proxim¬ 
ity  between  Sec62p  and  Mfa^- 
Cui,-Dha  is  transient  and  spe¬ 
cific.  (A)  Immunoblot  analysis 
of  extracts  of  S,  ccrcvisiac  coex¬ 
pressing  the  Mfa37-C„i,-Dha 
(construct  8;  Figure  2)  and  one 
of  the  following  constructs: 

N„t,-Sec62p,  integrated  (lane  a) 
or  plasmid-borne  (lane  b);  N^^,- 
Boslp  (lane  c);  N^,i,-Stel4p  (lane 
d);  Nut,-Sed5p(Iane  e);  and  N^t,- 
Tpilp  (lane  f).  (B)  Immunoblot 
analysis  of  extracts  of  S.  cerevi- 
siae  expressing  the  Tpilp-Cyt,- 
Dha  fusion  (construct  14;  Figure 
2)  alone  (lane  a)  or  together 
with  one  of  the  following  con¬ 
structs:  Nui,-Sec62p,  either  inte¬ 
grated  (lane  b)  or  plasmid- 
borne  (lane  c);  N^i^-Boslp  (lane 
d);  N„i,-Stel4p  (lane  e);  N^^- 
Sed5p(lane  f);  and  Nu^.-Tpilp 
(lane  g).  Number  14  on  the  left 
indicates  the  position  of  un¬ 
cleaved  Tpilp-C^,t,-Dha.  (C)  S. 
ccreiusiae  cells  expressing 
Mfa37-Cub-C>ha  (construct  8; 

Figure  2)  together  with  either 
the  vector  (lane  a),  N^i^-Boslp 
(lane  b),  or  Nut,-Sec62p  (lane  c) 
were  labeled  for  5  min  with 
methionine.  The  extracted  pro¬ 
teins  were  immunoprecipitated 
with  anti-ha  antibody  and  ana¬ 
lyzed  by  SDS-PAGE.  (D)  Quan¬ 
titation  of  the  pulse-labeling  ex¬ 
periment  (C)  using  Phosphor- 
Imager.  The  extent  of  Dha  re¬ 
lease  in  the  presence  of  N^i,- 
Sec62p  was  arbitrarily  set  at 
100.  The  averages  of  three  ex¬ 
periments  are  shown.  Lanes  a, 
b,  and  c  are  the  same  as  in  panel 

C.  (E)  S.  cerevisine  cells  expressing  Mfa^-Cub-Dha  together  with  Nyt,'Sec62p  were  labeled  for  5  min  with  ^'^S-methionine  and  chased  for 
5  and  15  min,  followed  by  extraction  of  proteins,  immunoprecipitation  with  anti-ha  antibody,  and  SDS-PAGE. 


(Tpilp),  yielding  Tpil-Cyb-Dha  (Figure  2B).  All  of  the 
N^b'fusion  proteins  in  Figure  2A  induced  a  significant 
release  of  Dha  from  the  test  protein  Tpil-Cyb-Dha 
(Figure  4B).  This  analysis  also  suggested  that 
Boslp  was  expressed  to  higher  levels  than  other  N^b 
fusions. 

To  quantify  the  relative  proximities  of  N^^b-Boslp 
and  Nub-Sec62p  to  Mfa37“Cub"Dha,  yeast  cells  were 
labeled  for  5  min  with  ^^S-methionine,  and  the  re¬ 
leased  Dha  was  determined  as  described  in  the  legend 
to  Figure  4.  Coexpression  of  Nub-Sec62p  and  Mfa37- 
Cub'Dha  yielded  ^15  times  more  of  the  free  Dha  than 
coexpression  of  N^b-Boslp  and  MfQ:37-Cub"Dha  (Fig¬ 
ure  4,  C  and  D).  Assuming  that  the  N^b  moieties  in 
Nub-Sec62p  and  N^b-Boslp  were  equally  accessible  to 
the  cytosol  (Figure  4B),  we  concluded  that  the  time- 
averaged  proximity  between  the  nascent  chain  of 


MfQ:37-Cub“Dha  and  the  N^b'^earing  transmembrane 
proteins  was  much  higher  for  Sec62p  than  for  the  ER 
membrane  proteins  that  are  not  involved  in  targeting 
or  translocation.  Note  that  this  analysis  may  actually 
underestimate  the  proximity  of  Sec62p  to  the  nascent 
chain,  because  we  invariably  observed  a  more  efficient 
cleavage  of  Mfa37-Cub-Dha  when  Nub-Sec62p  was  the 
only  form  of  Sec62p  in  the  cell  (Figures  4A  and  6A). 
Therefore  we  interpret  the  reduced  cleavage  of  Mfa37- 
Cub-Dha  in  the  presence  of  both  Nub-Sec62p  and  the 
native  Sec62p  as  the  consequence  of  competition  be¬ 
tween  those  two  Sec62p-containing  species  for  either 
the  signal  sequences  of  translocated  proteins  or  the 
ligands  of  Sec62p  in  the  complex  of  Sec  proteins. 

Recent  evidence  indicates  that  misfolded  or  other¬ 
wise  abnormal  proteins  in  the  lumen  of  the  ER  can  be 
retrotransported  across  the  ER  membrane  back  into 
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Figure  5.  The  nature  of  the  sig¬ 
nal  sequence  and  its  distance 
from  Ci,t,  determine  the  extent  of 
cleavage  of  in  the  presence 
of  Nui,-Sec62p.  (A)  S.  cerevisiae  ex¬ 
pressing  Mfa3^ut,-Dha  (con¬ 
struct  8;  Figure  2)  (lanes  a  and  b), 
Mfc^3-Cui,-Dha  (construct  9) 
(lanes  c  and  d),  Suc223-Ci,t,-Dha 
(construct  10)  (lanes  e  and  f), 
Suc233-Cub'Dl^  (construct  11) 
(lanes  g  and  h),  Suc259-C„b-Dha 
(construct  12)  (lanes  i  and  j)  and 
Suc25,p-Cub’Dha  (construct  13) 
(lanes  k  and  1)  were  labeled  with 
■^'^methionine  for  5  min.  The  ex¬ 
tracted  proteins  were  either 
mock-treated  (lanes  a,  c,  e,  g,  i, 
and  k)  or  treated  with  EndoH 
(lanes  b,  d,  f,  h,  and  1),  followed 
by  immunoprecipitation  with  an¬ 
ti-ha  antibody  and  SDS-PAGE. 

(B)  Same  as  panel  A  but  the  cells 
also  contain^  N„b’Sec62p  in  ad¬ 
dition  to  the  Cub-fusions  Mfa^T- 
C^b-Dha,  Mfa^^s-C„b-Dha,  Suc223- 
Cub-Dha,  Suc233-Cub-Dha,  Suc25o- 
Cub-Dha,  Suc2s,H-Cub-Dha  (lanes 
a-f).  The  analysis  was  carried  out 
by  immunoblotting  whole-  cell 
extracts  with  the  anti-ha  antibody. 

(C)  S.  cerevisiae  cells  expressing 
Suc223'Cub-Dha  (construct  10; 
Figure  2)  (lanes  a-c)  and  Suc25c,- 
Cub'Dha  (construct  12;  Figure  2) 
(lanes  d-f)  together  with  either 
Nub-Sec62p  (lanes  b  and  e),  N^^b- 
Boslp  (lanes  c  and  f)  or  the  vector 
(lanes  a  and  d)  were  labeled  for  5 
min  with  ^‘S-methionine.  Whole¬ 
cell  extracts  were  immunoprecipi- 
tated  with  anti-ha  antibody,  fol¬ 
lowed  by  SDS-PAGE  and 
autoradiography.  (D)  S.  cerevisiae 
cells  expressing  ASuc223-Cub-Dha 
(construct  15;  Figure  2)  or 
ASuc259-Cub-'Dha  (construct  16; 
Figure  2)  together  with  either  the 
vector  (first  six  lanes)  or  N^b- 
Sec62p  (last  sb<  lanes)  were  la¬ 
beled  for  5  min  with  ^^methio¬ 
nine  and  chased  for  10  and  30 
min,  followed  by  extraction,  im- 
munopredpitation  with  anti-ha 
antibody,  and  SDS-PAGE.  Num¬ 
bers  15  and  16  indicate  the  posi¬ 
tions  of  the  corresponding  (un¬ 
cleaved)  Cub  fusions. 


the  cytosol,  where  they  are  degraded  by  the  Ub  system 
(Biederer  et  al,  1996;  Hiller  et  al,  1996;  Wiertz  et  al, 
1996).  This  retrotransport  involves  at  least  some  of  the 
components  of  the  known  ER  translocation  machinery 
(Plemper  et  al,  1997).  To  determine  whether  the  cleav¬ 
age  of  Mfa37-Cub-Dha  at  the  Cut,-Dha  junction  occurs 


during  translocation  into  the  ER  or  during  (in  this 
case)  a  hypothetical  retrotransport  from  the  ER,  cells 
coexpressing  Nub-Sec62p  and  lClfa37-Cub-Dha  were  la¬ 
beled  for  5  min  with  ^^S-methionine,  and  then  chased 
for  15  min  (Figure  4E).  Although  the  translocated 
Mfa37-Cub-Dha  disappeared  rapidly  during  the  chase. 
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the  amount  of  free  Dha  that  accumulated  during  the 
pulse  remained  constant. 

We  conclude  that  the  in  vivo  proximity  between 
Sec62p  and  Mfa37-Cub-Dha  that  is  detected  by  the 
split-Ub  assay  occurs  either  during  or  very  shortly 
after  the  sjmthesis  of  MfQ:37-Cub-Dha.  The  apparent 
disappearance  of  the  pulse-labeled,  translocated 
Mfa37-Cub-Dha  during  the  chase  accounts  for  the  dif¬ 
ficulty  in  detecting  this  species  by  a  steady-state  assay 
such  as  immunoblotting  (Figures  4A,  5B,  and  6A).  The 
likely  cause  of  the  disappearance  of  translocated 
Mfa37-Cub-Dha  is  its  molecular  mass  heterogeneity, 
owing  to  its  glycosylation,  which  results  in  a  smear 
upon  SDS-PAGE  (Figures  3A,  4C,  and  4E). 

The  Efficiency  of  Ub  Reconstitution  Mediated  by 
N„ij~Sec62p  Depends  on  Both  the  Identity  of  a  Signal 
Sequence  and  the  Position  of  ^ub  in  the  Nascent 
Polypeptide  Chain 

The  proximity  of  Sec62p  to  the  signal  sequence  of 
Mfa37-Cub-Dha  is  detected,  in  the  split-Ub  assay, 
through  the  ability  of  Nub*Sec62p  to  induce  the  cleav¬ 
age  of  Mfa37-Cub'Dha  at  the  Cub-Dha  junction  (Figure 
3A).  If  this  cleavage  reflects  the  physical  proximity 
between  Sec62p  and  a  signal  sequence,  the  efficiency 
of  cleavage  should  decrease  if  the  Qb  moiety  is  moved 
closer  to  the  C  terminus  of  the  nascent  polypeptide 
chain.  However,  this  purely  spatial  consideration  ne¬ 
glects  the  temporal  aspect  of  the  translocation  process 
(Walter  and  Johnson,  1994).  The  targeting  and  the 
actual  translocation  are  initiated  during  or  shortly  af¬ 
ter  the  synthesis  of  a  signal  sequence-bearing  protein. 
Consequently,  the  C-terminal  parts  of  the  nascent 
chain  may  still  be  synthesized,  or  at  least  associated 
with  the  ribosome,  at  the  time  when  Sec62p  and  the 
signal  sequence  have  already  become  spatially  close. 
Extending  the  spacer  would  increase  the  distance  be¬ 
tween  Cub  3nd  the  signal  sequence  of  Mfa37-Cub-Dha. 
This  would  be  expected  to  decrease  the  time  window 
available  for  the  interaction  between  the  Cub  moiety  of 
Mfa37-Cub*Dha  and  the  Nub  moiety  of  Nub-Sec62p. 
Therefore,  a  test  of  this  kind  cannot  deconvolute  the 
contribution  of  each  of  the  two  parameters  (increased 
spatial  distance  along  the  chain  between  Sec62p  and 
Cub  srid  decreased  time  window  for  the  Nub-Cub  in¬ 
teraction)  to  the  overall  effect  of  extending  the  length 
of  the  polypeptide  between  the  signal  sequence  and 
Cub-  These  constraints  notwithstanding,  moving  the 
Cub  moiety  of  Mfa37-Cub-Dha  further  away  from  its 
signal  sequence  makes  it  possible  to  gauge  the  acces¬ 
sibility  of  Sec62p  to  specific  regions  of  the  nascent 
polypeptide  chain  in  vivo. 

In  the  actual  experiment,  the  distance  between  the 
signal  sequence  of  MfaaT-Cub-Dha  and  its  Cub  moiety 
was  increased  from  37  to  65  residues  (Mfogg-Cub-Dha; 
Figure  2B,  construct  9).  The  results  of  EndoH  treat¬ 


ment  of  Mfttbp-Cub-Dha  immunoprecipitated  from 
pulse-labeled  wild-type  cells  confirmed  that  Mfa55- 
Cub-Dha  was  efficiently  translocated  into  the  ER  (Fig¬ 
ure  5A).  However,  the  efficiency  of  the  Dha-yielding 
cleavage  of  Mfo^s-Cub-Dha  upon  coexpression  of  Nub- 
Sec62p  was  clearly  reduced  in  comparison  to  the  same 
cleavage  with  Mfa37-Cub-Dha  and  Nub-Sec62p  (Figure 
5B). 

Both  the  kinetics  and  the  mode  of  targeting  for 
translocation  are  influenced  by  the  identity  of  a  signal 
sequence  (Bird  et  ai,  1987;  Johnsson  and  Varshavsky, 
1994b;  Ng  et  ai,  1996).  For  example,  the  efficient  trans¬ 
location  of  invertase  (Suc2p)  requires  the  SRP,  in  con¬ 
trast  to  a  much  weaker  requirement  for  SRP  in  the  case 
of  the  prepro-a-factor's  signal  sequence  (Harm  and 
Walter,  1991;  Ogg  et  ai,  1992;  Johnsson  and  Var¬ 
shavsky,  1994b).  Consequently,  the  coupling  between 
translation  and  translocation  is  tighter  for  proteins 
bearing  the  invertase  signal  sequence  than  for  proteins 
carrying  the  signal  sequence  of  the  a-factor. 

We  assessed  the  in  vivo  proximity  of  the  invertase 
signal  sequence  to  Sec62p  by  measuring  the  reconsti¬ 
tution  of  Ub  from  Nub-Sec62p  and  Suc2-Cub-Dha, 
where  the  Suc2p  moiety  was  linked  to  Cub  through  a 
spacer  of  increasing  length  (Figure  2B).  The  expression 
and  efficient  translocation  of  different  Suc2-Cub-Dha 
constructs  were  assayed  by  immunoprecipitation  and 
subsequent  EndoH  treatment  (Figure  5A).  The  prox¬ 
imity  of  Cub  in  Suc2-Cub-Dha  to  Nub  of  Nub-Sec62p 
was  assayed  by  immunoblot  detection  of  the  cleavage- 
derived  free  Dha  in  whole-cell  extracts.  The  pattern 
already  observed  for  the  Mfa-Cub'Dha  constructs  re¬ 
curred  with  the  constructs  bearing  the  invertase  signal 
sequence  (Figure  5B).  Moreover,  coexpression  of  Nub- 
Sec62p  with  either  Suc223-Cub-Dha  or  Suc233-Cub-Dha 
yielded  lower  amounts  of  free  Dha  than  the  analogous 
assays  with  Nub-Sec62p  and  Mfa37-Cub"Dha,  which 
bears  a  spacer  of  comparable  length  (Figure  5B). 

Pulse-chase  analyses  with  cells  expressing  Nub* 
Sec62p  (or  Nyb-Boslp)  and  either  Suc223-Cub-Dha  or 
Suc259-Cub-Dha  confirmed  the  immunoblot  data.  Spe¬ 
cifically,  a  significant  release  of  free  Dha  was  observed 
only  for  the  pair  of  Nub-Sec62p  and  Suc223-Cub-Dha 
(Figure  5C).  Our  previous  work  has  shovm  that  the 
segment  of  the  nascent  polypeptide  chain  where  the 
Cub  moiety  was  inserted  in  either  the  Suc223-Cub-Dha 
or  the  Mfa!37-Cub-Dha  fusion  is  transiently  exposed  to 
the  cytosol — until  the  initiation  of  ER  translocation 
Gohnsson  and  Varshavsky,  1994b).  Therefore,  we 
compared  the  ratios  of  cleaved  to  uncleaved  Suc223- 
Cub-Dha  and  Mfa37-Cub-Dha.  Cells  expressing  Nub- 
Sec62p  and  either  the  Cub  fusion  8  or  10  (Figure  2B) 
were  labeled  for  5  min  with  ^®S-methionine  and  pro¬ 
cessed  for  immxmoprecipitation  with  anti-ha  antibody, 
foDowed  by  determination  of  the  deaved-to-imcleaved 
ratio  (Figure  6D).  This  ratio,  a  measure  of  the  time- 
averaged  proximity  of  Sec62p  to  a  translocating  protein. 
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Figure  6.  Secblp,  but  not  a  mu¬ 
tant  of  Sec62p,  are  elefee  to  the  nas¬ 
cent  chain  of  a  translocated  pro¬ 
tein.  (A)  These  assays  employ^  S. 
cercvisiac  expressing  N^a^T-Cu^,- 
Dha  (construct  8,  Figure  2)  and 
one  of  the  following  N^b  fusions 
(Figure  2):  Nub-Sec62p,  either  inte¬ 
grated  (lane  a)  or  plasmid  borne 
(lane  b);  N^b-Sec62(AC60)Dha 
(lane  c);  and  N^b-Secblp  flane  d). 
Whole-cell  extracts  from  these 
strains  were  subjected  to  immuno- 
blot  analysis  with  anti-ha  anti¬ 
body.  (B)  Same  as  panel  A  but  the 
same  N^b  fusions  were  coex¬ 
pressed  with  Tpil-Cub'Dha  (con¬ 
struct  14;  Figure  2).  Numbers  2 
and  14  indicate  the  positions  of  the 
corresponding  (uncleaved)  fu¬ 
sions.  (C)  Lane  a:  S.  cercvisiac  ex¬ 
pressing  Suc223-Cub-Dha  (con¬ 
struct  10;  Figure  2)  together  with 
either  N^b-Secblp,  N^^^-Secblp, 
Nu^-Sec61p,  N^b-Sec62p,  N^.,- 
Sec62p,  or  N„^-Sec62p;  lane  b: 
same  as  lane  a  but  cells  expressed 
Mfa37-C„b-Dha  (construct  8;  Fig¬ 
ure  2)  instead  of  Suc223-Cub“Dha. 
(D)  S.  cerevisiae  cells  expressing 
Mfa37-Cub“Dha  together  with 
Nub-Sec61p  (a  and  b)  or  N^b- 
Sec62p  (e  and  f),  and  cells  express¬ 
ing  Suc223-Cub'I^f‘a  together  with 
Nub-Sec61p  (c  and  d)  or  N^b" 
Sec62p  (g  and  h)  were  labeled  for  5 
min  with  •^'S-methionine.  Whole¬ 
cell  extracts  were  immunopredpi- 
tated  with  anti-ha  antibody,  fol¬ 
lowed  by  SDS-PAGE,  and 
quantitation  of  the  cleaved  and 
uncleaved  C^^b  fusions  using  Phos- 
phorlmager.  Shown  are  the  rela¬ 
tive  amounts  of  the  cleaved  (white 
bars:  a,  c,  e,  and  g)  and  uncleaved 


(black  bars:  b,  d,  f,  and  h)  C„i,  fusions.  The  sum  of  a  cleaved  and  undeaved  fusion  was  set  at  100  in  each  of  the  tinree  indepmdent  experiments.  SDs  are 
also  indicated. 


was  —eightfold  higher  for  a  nascent  polypeptide  bearing 
the  signal  sequence  of  a-factor  than  for  a  nascent 
polypeptide  bearing  the  invertase  signal  sequence  (Fig¬ 
ure  6D;  compare  Figures  4C  and  5C). 

Spacer  sequences  of  different  length  or  composition 
upstream  of  the  C^b  moiety  might  nonspecifically  in¬ 
fluence  the  interaction  between  N^b  and  Cyb-  To  assess 
this  potential  spacer  effect,  we  constructed  signal  se¬ 
quence-lacking  versions  of  Suc223-Cub-Dha  and 
Suc259-Cub-Dha  (Figure  2B,  Cub  fusions  15  and  16),  and 
compared  their  ability  to  reconstitute  Ub  in  the  pres¬ 
ence  of  coexpressed  Nub-Sec62p.  Both  of  these  Cub 
fusions  were  cleaved  at  the  Cub-Dha  jimction  at  approx¬ 
imately  the  same  rate  in  the  presence  of  Nub-Sec62p 
(Figure  5D),  in  contrast  to  the  marked  difference  in  the 
rate  of  cleavage  observed  for  their  signal  sequence-bear¬ 
ing  counterparts  (Figure  5,  B  and  C).  This  control  exper¬ 


iment  further  emphasized  the  effect  of  distance  between 
a  signal  sequence  and  the  Cyb  moiety  on  the  efficiency  of 
Ub  reconstitution  in  the  presence  of  Nub-Sec62p.  We 
conclude  that  the  accessibility  of  Sec62p  in  vivo  to  a 
specific  region  of  the  nascent  polypeptide  chain  is  influ¬ 
enced  by  both  the  nature  of  a  signal  sequence  and  its 
distance  from  that  region. 

SecSlp  Is  Equidistant  front  Two  Different  Signal 
Sequences 

A  direct  comparison  between  two  different  signal  se¬ 
quences  upstream  of  the  Cub  moiety  presumes  ap¬ 
proximately  equal  residence  times  of  the  correspond¬ 
ing  Cub  moieties  in  the  cytosol.  It  is  also  essential  to 
know  that  the  influence  of  the  identity  of  a  signal 
sequence  on  the  rate  of  Ub  reconstitution  is  not  due  to 
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a  nonspecific  intramolecular  interaction.  One  way  to 
address  these  issues  involves  measuring  the  reconsti¬ 
tution  of  Ub  from  the  C^t,  moieties  of  the  fusions  8  and 
10  (Figure  2B)  and  a  N^b-containing  fusion  that  is  not 
involved  in  translocation.  As  illustrated  in  Figure  4, 
this  test  is  not  feasible  because  of  the  rapid  transloca¬ 
tion  of  Cut,-containing  constructs  into  the  ER.  Note, 
however,  that  since  Sec61p  is  the  central  component  of 
the  translocation  pore,  proteins  that  utilize  different 
targeting  pathways  will  converge  at  Sec61p  shortly 
before  their  translocation  Gungnickel  and  Rapoport, 

1995) .  Taking  advantage  of  this  property  of  Sec61p,  we 
assayed  the  proximity  of  Mfa37-Cui,-Dha  and  Suc223- 
Cub'Dha  to  Nub‘Sec61p.  If  Mfa37-Cub-Dha  and  Suc223- 
Cub'Dha  are  cleaved  at  the  Cyb-Dha  junction  equally 
well  in  the  presence  of  N^b-Secblp,  the  above  inter¬ 
pretation  of  the  observed  selectivity  of  Nub-Sec62p 
toward  Mfa37-Cui,-Dha  (Figures  5  and  6)  would  be 
confirmed. 

To  ca^  out  this  test,  N„b  was  fused  to  the  cytosolic 
N  terminus  of  Sec61p  (Figure  2A)  (Wilkinson  et  al, 

1996) .  Nub-Sec61p  is  functionally  active  (Wittke  and 
Johnsson,  unpublished  data).  It  induced  the  release  of 
free  Dha  from  of  Mfa37-Cub-Dha  and  Tpilp-Cub-Dha 
with  efficiency  similar  to  that  of  Nub-Sec62p  (Figure  6, 
A  and  B).  Thus,  the  split-Ub  assay  independently  con¬ 
firmed  that  Sec61p  is  close  to  the  nascent  polypeptide 
chain  during  its  translocation.  To  compare  the  in  vivo 
interactions  of  Sec61p  with  the  C^b  fusions  8  and  10, 
which  bore  different  signal  sequences  (Figure  2B),  the 
amount  of  free  Dha  was  determined  by  immunoblot- 
ting  of  whole-cell  extracts.  It  was  found  that  in  the 
presence  of  Nub-Sec61p,  similar  amounts  of  Dha  were 
released  from  Mfa37-C^,b-Dha  and  Suc22,-C„b-Dha, 
whereas  in  the  presence  of  Nub-Sec62p  twice  as  much 
Dha  was  released  from  Mfa37-C„b-Dha  than  from 
Suc223-Cub-Dha  (Figure  6C). 

This  result  was  confirmed  and  extended  by  label¬ 
ing  the  cotransformed  cells  for  5  min  with  ^^g-me- 
thionine  and  quantifying  the  ratio  of  cleaved-to- 
uncleaved  C„b  fusions  (Figure  6D).  As  was  already 
observed  by  the  immunoblot  analysis,  the  above 
ratio  was  ~1  for  both  Mfa37-C„b-Dha  and  Suc223- 
Cub-Dha  in  the  presence  of  Nub-Sec61p,  but  ~2  for 
Mfa37-Cub-Dha,  and  -0.25  for  Suc223-Cub-Dha  in  the 
presence  of  N„b-Sec62p  (Figure  6Dj.  The  difference 
revealed  by  the  pulse-immunopredpitation  analysis 
is  higher  than  the  estimate  obtained  by  the  immu¬ 
noblot  analysis,  most  likely  because  of  the  continu¬ 
ous  accumulation  of  cleaved  (and  long-lived)  Dha 
before  the  processing  of  cells  for  immunoblotting. 

A  C-terminally  Truncated  Sec62p  Is  No  Longer 
Proximal  to  the  Signal  Sequence 

Does  the  proximity  of  Sec62p  to  a  nascent  polypeptide 
chain  that  is  detected  by  the  split-Ub  assay  reflect  the 


physical  binding  of  the  signal  sequence  to  this  pro¬ 
tein?  We  constructed  a  derivative  of  Nui,-Sec62p  in 
which  the  C-terminal  60  residues  of  Sec62p  were  re¬ 
placed  by  the  DHFR-ha  (Dha)  moiety,  yielding 
Sec62(AC60)-Dha.  A  similar  Sec62p-invertase  fusion 
was  described  by  Deshaies  and  Schekman  (1990)  and 
shown  to  be  nonfunctional.  As  expected,  neither 
Sec62(AC60)-Dha  nor  N„b-Sec62(AC60)-Dha  comple¬ 
mented  the  ts  phenotype  of  RSY529  cells  (Figure  3B). 

The  Ub-reconstitution  activity  of  N„b-Sec62(AC60)- 
Dha  in  the  presence  of  either  Mfa37-C^b-Dha  or  Tpil- 
Cub-Dha  (Figure  6,  A  and  B)  was  compared  with  the 
activity  of  Nub"Sec62p  and  Nui,-Sec61p  in  the  presence 
of  the  same  Cub'Containing  fusions.  Remarkably,  no 
cleavage  of  Mfa37-Cu[,-Dha  was  observed  in  the  pres¬ 
ence  of  Nub*Sec62(AC60)-Dha,  whereas  the  cytosolic 
Tpil-Cub-Dha  was  cleaved.  This  result  (Figure  6,  A 
and  B)  indicated  that  the  concentration  and  accessibil¬ 
ity  of  Nub  were  comparable  for  the  functionally  inac- 
five  Nub-Sec62(AC60)-Dha  and  the  functionally  active 
Nub-Sec62p.  In  these  experiments,  Nub-Sec62(AC60)- 
Dha,  which  could  be  detected  with  the  anti-ha  anti¬ 
body  (Figure  6,  A  and  B),  was  expressed  from  the 
uninduced  Pcupi  promoter.  Strikingly,  even  overex¬ 
pression  of  Nub-Sec62(AC60)-Dha,  from  the  copper- 
induced  Pcupi/  did  not  result  in  a  significant  cleavage 
of  Mfa37-Cub-Dha  (our  unpublished  results).  These 
control  experiments  with  the  inactive  derivative  of 
Sec62p  indicated  that  the  proximity  signal  in  the 
split-Ub  assay  with  Sec62p  requires  the  functional  ac¬ 
tivity  of  Sec62p. 

Using  Ura3p  Reporter  to  Detect  the  In  Vivo 
Proximity  between  Sec62p  and  Signal  Sequences 

The  DHFR-ha  (Dha)  reporter  moiety  of  Mfa37-Cub- 
Dha  was  replaced  by  S.  cerevisiae  Ura3p  (orotidine-5'- 
phosphate  decarboxylase),  yielding  Mfa37-Cub-Ura3p. 
"^e  use  of  cytosolic  Ura3p  as  a  reporter  for  transloca¬ 
tion  across  membranes  is  well  documented  (Maarse  et 
al,  1992;  Johnsson  and  Varshavsky,  1994b;  Ng  et  al, 
1996).  The  high  sensitivity  of  Ura3p-based  assays  (cells 
become  Ura"*"  if  a  threshold  amount  of  Ura3p  is 
present  in  the  cytosol)  allowed  us  to  express  the  N^^ 
and  Cub  fusions  from  the  uninduced  Pcupi  promoter. 
Since  the  efficient  translocation  of  Mfa37-Cub-Ura3p 
sequesters  the  Ura3p  activity  in  the  ER,  a  uraSA  strain 
of  S.  cerevisiae  that  expressed  Mfa37-Cub-Ura3p  re¬ 
mained  Ura  (Figure  7 A).  Nub-Sec62p,  which,  as 
shown  above,  is  close  to  the  nascent  chain  of  Mfa37- 
Cub-Dha  during  its  translocation,  induced  enough 
cleavage  of  Mfa37-Cub-Ura3p  at  the  Cub-Ura3p  junc¬ 
tion  to  render  cells  Ura+  (Figure  7B).  Cells  were  trans- 
formed  with  either  N^i,-Sec62p,  N„t,-Sec61p, 
Sec62(AC60)“Dha,  or  N^b-Boslp  to  compare  relative 
proximities  of  these  N^i^-containing  proteins  to 
fusions  bearing  the  Ura3p  reporter  moiety  and  either 
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Figure  7.  The  use  of  a  metabolic  marker  to  assess  the  proximity 
between  a  component  of  the  translocation  machinery  and  a  trans¬ 
located  protein.  S.  cerevisiae  expressing  the  fusions  8-12  (Figure 
2)  that  contained  Ura3p  instead  of  Dha  (see  the  main  text)  were 
transformed  with  the  vector  (A)  or  plasmids  expressing  Nub“Sec62p 
(B),  N„b-Sec61p  (C),  N„b-Sec62(AC60)Dha  (D),  and  N^b'Boslp  (E). 
Cells  were  grown  in  a  liquid  uracil-containing  SD  medium,  and 
10^,  and  10^  cells  were  spotted  onto  uracil-lacking  SD  me¬ 
dium.  Plates  were  examined  after  18  h  at  30°C. 


the  invertase-derived  or  the  a-factor-derived  signal 
sequence  (Figure  1,  B-E).  The  cells  were  spotted  on 
plates  lacking  uracil  and  incubated  at  30®C  for  18  h. 
The  growth  patterns  of  strains  that  expressed  different 
comWnations  of  and  Cub-containing  fusions  con¬ 
firmed  the  results  of  analyses  with  analogous  (but 
more  highly  expressed)  Dha-based  constructs. 

In  particular,  the  interaction  of  Sec62p  with  the  sig¬ 
nal  sequence  of  prepro-a-factor  was  stronger  than 
with  the  signal  sequence  of  invertase.  This  proximity 
was  not  detectable  when  the  distance  between  a  signal 
sequence  and  the  C^b  nioiety  of  a  fusion  was  increased 
(Figure  7B).  Sec61p  appears  to  be  equally  close  to  both 
of  the  signal  sequences  tested.  Again,  the  proximity 
signal  was  gradually  lost  when  the  distance  between 
the  signal  sequence  and  the  C^b  moiety  was  increased 
(Figure  7C).  Cells  acquired  a  weak  Ura"^  phenotype  in 
the  presence  of  N^b-Boslp  and  the  C^b-UraSp  fusions 


8  and  10  (Figure  7E).  If  used  as  a  reference  to  discrim¬ 
inate  between  specific  and  nonspecific  signals  in  this 
assay,  Sec62p,  under  these  conditions,  appears  to  in¬ 
teract  only  with  the  a-f actor  signal  sequence.  No  in¬ 
teraction  with  any  of  the  tested  C^b  constructs  was 
detectable  with  the  functionally  inactive  N^b- 
Sec62(AC60)-Dha  (Figure  7D). 

DISCUSSION 

The  new  application  of  the  split-Ub  technique  (Johns- 
son  and  Varshavsky,  1994a,  1997)  described  in  the 
present  work  introduces  a  tool  for  the  analysis  of 
transient  (short-lived)  protein  interactions  in  living 
cells.  A  split-Ub  assay  involves  the  tagging  of  two 
(presumably)  interacting  proteins  with  the  N-  and 
C-terminal  halves  of  Ub,  N^b  and  C^b/  and  monitoring, 
in  a  variety  of  ways,  the  release  of  the  reporter  protein 
fused  to  the  C  terminus  of  C^b*  The  reporter  release, 
through  the  cleavage  by  Ub-specific  processing  pro¬ 
teases  (UBPs),  takes  place  in  the  cytosol  if  the  two 
halves  of  Ub  interact  in  vivo  to  form  a  quasi-native  Ub 
moiety  upstream  of  the  reporter  (Figure  1).  Among  the 
advantages  of  this  method  are  its  applicability  either 
in  living  cells  or  in  vitro  and  its  sensitivity  to  kinetic 
aspects  of  a  protein  interaction. 

In  the  present  work,  we  applied  the  split-Ub  tech¬ 
nique  to  the  problem  of  protein  translocation  across 
membranes.  We  showed  that  Sec62p  of  S.  cerevisiae  is 
spatially  close  to  the  signal  sequence  of  the  nascent 
a-factor  polypeptide  in  vivo.  This  proximity  is  con¬ 
fined  to  the  nascent  polypeptide  chain  immediately 
following  the  signal  sequence.  In  addition,  the  extent 
of  proximity  depends  on  the  nature  of  the  signal  se¬ 
quence.  Specifically,  Cub-containing  test  proteins  that 
bore  the  signal  sequence  of  invertase  resulted  in  a 
much  lower  Ub  reconstitution  with  Nub-Sec62p  than 
the  same  C^b-containing  proteins  bearing  the  signal 
sequence  of  a-factor.  An  inactive  derivative  of  Sec62p 
failed  to  interact  with  signal  sequences  in  the  split-Ub 
assay.  Taken  together,  these  findings  are  the  first  in 
vivo  evidence  that  S.  cerevisiae  Sec62p,  a  component  of 
the  ER  translocation  machinery,  is  a  part  of  a  signal 
sequence-binding  complex. 


In  Vivo  Proximity  between  Sec62p  and  the  Signal 
Sequence  of  a-Factor 

We  have  previously  shown  that  a  region  of  the  nascent 
polypeptide  chain  that  lies  close  to  the  signal  sequence 
of  invertase  or  the  prepro-a-factor  is  briefly  exposed  to 
the  cytosol  before  its  translocation  into  the  ER  (Johns- 
son  and  Varshavsky,  1994b).  This  feature  of  transloca¬ 
tion  enabled  us,  in  the  present  work,  to  use  the’ 
split-Ub  assay  for  monitoring  the  proximity  between  a 
secretory  protein  and  components  of  the  translocation 
machinery.  The  C^b  moiety  was  placed  37  residues 
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downstream  from  the  signal  sequence  of  the  a-factor 
precursor,  and  the  N  terminus  of  Sec62p  (see  INTRO¬ 
DUCTION)  was  extended  with  N^t,-  Using  this  ver¬ 
sion  of  the  split-Ub  assay,  we  could  demonstrate  that 
Sec62p  is  close  to  the  nascent  chain  of  the  a-factor 
during  its  translocation.  By  moving  the  Cuj,  moiety 
farther  downstream  from  the  signal  sequence  of  the 
a-factor  precursor,  we  obtained  "snapshots"  of  the 
relative  proximity  between  the  nascent  polypeptide 
chain  and  Sec62p  in  vivo.  The  proximity  thus  detected 
was  considerably  reduced  once  the  spacer  sequence 
between  the  signal  sequence  of  the  a-factor  precursor 
and  Cub  was  increased  from  37  to  65  residues  (Figures 
5B  and  7B).  The  data  strongly  suggest  that  the  access 
of  Sec62p  to  the  nascent  chain  of  a-factor  is  confined  to 
a  region  of  the  nascent  chain  that  is  very  close  to  the 
signal  sequence.  Similar  results  were  obtained  with 
the  signal  sequence  of  invertase  as  well.  This  property 
of  Sec62p  is  the  expected  feature  of  a  component  of  a 
signal  sequence  receptor. 

Our  interpretation,  supported  by  several  control  ex¬ 
periments,  is  in  agreement  with  the  results  produced 
by  cross-linking  and  binding  studies  in  cell-free  sys¬ 
tems  (Miisch  et  al,  1992;  Lyman  and  Schekman,  1997; 
Matlack  et  al,  1997).  Sec62p  could  be  cross-linked  to 
the  a-factor  precursor  in  vitro,  but  only  when  ATP 
was  omitted  and  the  initiation  of  translocation  of 
a-factor  was  halted.  Upon  the  addition  of  ATP,  the 
translocation  resumed  and  cross-linking  was  no 
longer  possible  (Miisch  et  al,  1992;  Lyman  and  Schek¬ 
man,  1997).  The  cross-linking  between  Sec62p  and  the 
nascent  polypeptide  chain  was  not  observed  when  the 
translocating  chain  was  halted  in  the  ER  channel 
(Miisch  et  al,  1992;  Sanders  et  al,  1992).  It  was  there¬ 
fore  assumed  that  Sec62p  is  not  a  part  of  the  channel 
and  that  it  functions  in  the  early  steps  of  substrate 
recognition  and  initiation  of  translocation.  The 
split-Ub  assay,  in  its  current  form,  depends  on  both 
halves  of  Ub  being  in  the  cell's  cytosol.  Therefore,  the 
absence  of  the  diagnostic  cleavage  (Figures  5B,  5C, 
and  7)  when  the  C^b  moiety  was  placed  farther  down¬ 
stream  from  the  signal  sequence  (see  RESULTS),  while 
consistent  with  the  absence  of  interactions  between 
Sec62p  and  the  nascent  chain  after  the  initiation  of  its 
translocation,  does  not  address  this  issue  directly.  The 
Cub  nioiety  that  emerges  from  the  ribosome  after  it  has 
docked  at  the  ER  channel  is  not  accessible  to  the 
cytosolic  Nub  moiety  even  if  an  Nub-linked  protein  is 
spatially  close  to  the  translocation  pore.  This  also  ex¬ 
plains  the  inability  of  Nub-Sec61p  to  induce  the  cleav¬ 
age  of  Cub-containing  translocation  substrates  bearing 
long  spacer  sequences  between  the  signal  sequence  and 
Cub/  ^though  tire  in  vitro  cross-linking  studies  have 
shown  Sec61p  to  be  in  constant  contact  with  the  trans¬ 
locating  polypeptide  (Mothes  et  al,  1994)  (Figure  7). 

The  proximity  between  Sec62p  (or  Sec61p)  and  a 
translocating  polypeptide  is  short  lived.  The  rapid 


transfer  of  the  Cub  moiety  into  the  lumen  of  the  ER 
was  shown  to  either  prevent  or  strongly  inhibit  its 
interaction  with  the  Nub  moiety  of  Nub-Sec62p  and 
Nub"Scc61p  (Figures  3A  and  6C).  In  these  experiments, 
the  Nub  moieties  bore  either  the  glycine  (Nug)  or  the 
alanine  (Nua)  residue  at  position  13  of  Nub-  These 
modifications  decrease  the  affinity  between  the  two 
halves  of  Ub  (see  INTRODUCTION),  thereby  making 
the  reconstitution  of  a  quasi-native  Ub  moiety  more 
dependent  on  the  stability  (half-life)  of  interactions 
between  the  proteins  linked  to  Nub  and  Cub-  These 
assays  clearly  distinguished  the  Sec62p-Sec61p  signal 
sequence  interactions  from  those  that  underlie  the  bet¬ 
ter  understood,  longer-lived  protein  complexes.  For 
example,  when  linked  to  homodimerizing  leucine  zip¬ 
pers,  the  Nua  moiety,  and  even  the  Nug  moiety,  is 
sufficient  for  reconstitution  of  the  Ub  (Jolmsson  and 
Varshavsky,  1994a). 

Is  Sec62p  Part  of  a  Signal  Sequence  Receptor? 

The  split-Ub  assay  measures  the  concentrations  of  the 
protein-coupled  Nyb  and  Cub  moieties  in  the  immedi¬ 
ate  vicinity  of  each  other.  Therefore,  a  positive  result 
of  a  split-Ub  assay  signifies  a  spatial  proximity  be¬ 
tween  the  two  proteins  but  cannot,  by  itself,  prove 
their  physical  interaction  or  address  the  functional 
significance  of  this  proximity.  Nub-Sec62(AC60)Dha  is 
functionally  inactive  and  was  shown  to  be  not  close  to 
the  translocating  Mfa37-Cub-Dha  (Figures  6  and  7). 
Since  the  sequence  between  Nub  and  the  first  mem¬ 
brane-spanning  region  of  Sec62  was  retained  in  the 
C-terminally  truncated  Sec62(AC60)Dha,  the  distance 
between  Nub  and  the  ER  membrane  was,  most  prob¬ 
ably,  not  altered  relative  to  wild-type  Sec62p.  The  lack 
of  significant  cleavage  of  MfagT-Cub-Dha  in  the  pres¬ 
ence  of  Nub-Sec62(AC60)Dha  must  therefore  result 
from  the  increased  distance  between  Sec62(AC60)p 
and  the  translocating  polypeptide  chain.  The  C-termi- 
nal  domain  of  intact  Sec62p  may  contact  other  com¬ 
ponents  of  the  translocation  complex;  alternatively,  it 
may  contribute  to  a  binding  site  for  the  signal  se¬ 
quence  or  the  nascent  chain.  These  and  related  uncer¬ 
tainties  notwithstanding,  our  results  (Figures  6  and  7) 
provide  the  first  in  vivo  evidence  to  support  the  view 
that  Sec62p  is  part  of  a  signal  sequence-binding  com¬ 
plex. 

Sec62p  Discriminates  between  Different  Signal 
Sequences 

The  split-Ub  assay  has  made  it  possible  to  show  that 
Sec62p  discriminates,  in  living  cells,  between  two  dis¬ 
tinct  signal  sequences  in  otherwise  identical  fusion 
proteins.  In  the  presence  of  Nub-Sec62p,  more  of  the 
free  Dha  reporter  protein  was  produced  in  vivo  from 
Mfa37-Cub"Dha  than  from  Suc223-Cub*Dha  (Figures  5 
and  6).  This  selectivity  is  a  property  of  Sec62p  and  not 
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a  feature  of  the  assay  used,  since  approximately  equal 
amounts  of  the  cleaved  reporter  were  produced  with 
different  signal  sequences  if  the  N^t,  moiety  was 
present  as  the  Nut,“Sec61p  fusion  (Figures  6  and  7). 
This  result  confirmed,  in  vivo,  that  different  targeting 
pathways  converge  at  the  Sec61-containing  complex 
to  initiate  translocation.  The  existence  of  at  least  two 
different  targeting  pathways  to  the  translocation  pore 
was  suggested  by  Walter  and  co-workers  on  the  basis 
of  the  properties  of  yeast  mutants  that  lacked  either 
SRP  or  its  receptor  (Hann  and  Walter,  1991;  Ogg  et  ah, 
1992).  One  possibility  is  that  the  targeting  via  SRP 
operates  cotranslationally,  whereas  the  targeting  via 
the  Sec62/63  complex  is  predominantly  posttransla- 
tional.  The  bifurcation  between  the  cotranslational  and 
posttranslational  targeting  is  expected  to  be  stochastic 
for  many  translocated  proteins.  Nonetheless,  certain 
signal  sequences  do  prefer  SRP,  while  some  of  the 
other  signal  sequences  are  targeted  by  the  Sec62/ 
Sec63  complex  (Ng  et  ai,  1996).  Genetic  studies  have 
shown  that  the  translocation  of  invertase  continues  in 
the  presence  of  mutations  in  either  the  SRP  or  the 
Sec62/63  complex  (Deshaies  and  Schekman,  1989; 
Ogg  et  ah,  1992).  However,  the  kinetics  and  efficiency 
of  invertase  translocation  are  altered  in  the  absence  of 
SRP  (Hann  and  Walter,  1991;  Johnsson  and  Var¬ 
shavsky,  1994b). 

To  explain  the  different  efficiencies  of  cleavage  of  the 
two  signal  sequence-bearing  fusions,  we  propose 
the  following  model.  The  signal  sequence  of  invertase 
is  recognized  primarily  by  SRP  and  then  transferred  to 
the  trimeric  Sec61p  complex,  which  completes  the  pro¬ 
tein's  translocation  across  the  ER  membrane.  Under 
conditions  that  result  in  a  shortage  of  SRP  or  a  com¬ 
petition  among  different  signal  sequences,  the  Sec62p/ 
Sec63p-containing  complex,  being  a  part  of  the  alter¬ 
native  targeting  pathway,  would  recognize  an 
increasing  fraction  of  invertase.  This  would  explain 
why  specific  interactions  between  the  invertase  signal 
sequence-bearing  proteins  and  Sec62p  can  only  be 
observed  for  the  more  highly  expressed  C^^b^Dha  fu¬ 
sions  (Figure  5C).  By  contrast,  the  targeting  of  proteins 
bearing  the  signal  sequence  of  the  a-factor  precursor  is 
mediated,  in  vivo,  predominantly  by  the  Sec62p/ 
Sec63p-containing  complex.  This  would  account  for 
the  observed  close  proximity  of  these  test  proteins  to 
Sec62p  (Figures  5-7).  If  so,  the  split-Ub  technique 
makes  it  possible  to  estimate  the  flux  of  two  different 
secretory  proteins  through  the  targeting  pathways  to 
the  ER  membrane  without  the  necessity  of  deleting  or 
otherwise  inactivating  specific  components  of  the  tar¬ 
geting  complex. 

Further  Applications  of  the  SpIit-Ub  Technique 

The  split-Ub  sensor  should  also  be  applicable  to  other 
settings  that  involve  short-lived  protein  interactions 


that  occur  in  the  cytosol  and  are  freely  accessible  to  the 
Ub-specific  proteases.  The  advantage  of  using  this 
method  for  the  analysis  of  protein  translocation  stems, 
in  part,  from  the  fast  and  irreversible  removal  of  the 
translocated  chain  from  the  location  (cytosol)  where 
the  ^ub/^ub  interaction  is  monitored.  Similar  situa¬ 
tions  are  expected  for  the  translocation  of  proteins  into 
other  organelles  such  as  the  mitochondrion,  the  nu¬ 
cleus,  and  the  peroxisome. 

The  demonstration,  in  the  present  work,  that  Ura3p 
can  serve  as  a  reporter  in  a  split-Ub  assay  (Figure  7) 
opens  the  way  to  genetic  screens  based  on  this  assay. 
For  example,  introducing  a  DNA  library  consisting  of 
random  fusions  into  a  strain  expressing  a 

signal  sequence-bearing  Cub-Ura3p  protein  should  al¬ 
low  the  identification  of  genes  involved  in  targeting  or 
translocation  by  enabling  the  cells  to  form  colonies  on 
media  lacking  uracil.  This  selection  for  protein  ligands 
that  interact  transiently  in  the  \^icinity  of  a  membrane 
complements  the  recent  split-Ub- based  screen  for  li¬ 
gands  that  form  relatively  stable  complexes  (Stagljar  et 
al,  1998). 
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Summary 

We  report  the  discovery  of  a  short-lived  chaperone 
that  is  required  for  the  correct  maturation  of  the  eukar¬ 
yotic  20S  proteasome  and  is  destroyed  at  a  specific 
stage  of  the  assembly  process.  The  S.  cerevisiae  Umpip 
protein  is  a  component  of  proteasome  precursor  com¬ 
plexes  containing  unprocessed  p  subunits  but  is  not 
detected  in  the  mature  20S  proteasome.  Upon  the  as¬ 
sociation  of  two  precursor  complexes,  Umpip  is  en¬ 
cased  and  is  rapidly  degraded  after  the  proteolytic 
sites  in  the  interior  of  the  nascent  proteasome  are 
activated.  Cells  lacking  Umpi  p  exhibit  a  lack  of  coordi¬ 
nation  between  the  processing  of  p  subunits  and  pro¬ 
teasome  assembly,  resulting  in  functionally  impaired 
proteasomes.  We  also  show  that  the  propeptide  of 
the  Pre2p/Doa3p  p  subunit  is  required  for  Umpi  p’s 
function  in  proteasome  maturation. 

Introduction 

Ubiqurtin  (Ub)-dependent  proteolysis  underlies  the  bulk 
of  nonlysosomal  protein  degradation  In  eukaryotic  cells 
(reviewed  by  Hochstrasser,  1 996;  Varshavsky,  1 997).  Natu¬ 
rally  short-lived  as  well  as  damaged  or  otherwise  abnor¬ 
mal  proteins  are  recognized  by  the  Ub  system  and  are 
marked  for  degradation  by  the  attachment  of  multi-Ub 
chains.  Ubiquitylated  proteins  are  degraded  by  the  26S 
proteasome,  an  ~2000  kDa,  multisubunit,  ATP-depen- 
dent  protease  that  consists  of  the  1 9S  complex,  which  is 
required  specifically  for  the  degradation  of  ubiquitylated 
proteins,  and  an  ^^700  kDa  complex,  called  the  20S 
proteasome,  which  is  the  ATP-independent  catalytic 
core  of  the  26S  proteasome  (reviewed  by  Peters,  1994; 
Coux  et  al.,  1996;  Lupas  et  al.  1997). 

The  20S  proteasome  is  universal  among  eukaryotes; 
its  structural  homologs  have  also  been  found  in  archae¬ 
ons  and  eubacteiia  (reviewed  by  Coux  et  al.,  1 996).  High- 
resolution  crystal  structures  have  been  reported  for  the 
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20S  proteasomes  of  the  archaeon  Thermoplasma  ac/do- 
philum  and  the  eukaryote  Saccharomyces  cerevisiae 
(Lowe  et  al.,  1 995;  Groll  et  al.,  1 997).  The  Thermoplasma 
proteasome  contains  two  types  of  subunits,  a  and  p, 
which  form  a  hollow  cylinder  composed  of  four  hepta- 
meric  rings  in  the  configuration  07^73707*  The  yeast  20S 
proteasome  has  a  similar  structure  but  contains  14  dis¬ 
tinct  subunits,  seven  of  the  a  type  and  seven  of  the  p 
type.  The  N-terminal  threonines  of  the  p  subunits  of 
the  Thermoplasma  proteasome  act  as  nucleophiles  in 
catalyzing  the  hydrolysis  of  peptide  bonds  of  polypep¬ 
tide  substrates  (Seemuller  et  al..  1995).  Lys-33  and  Glu- 
17  of  the  p  subunit  also  play  a  role  In  catalyzing  the 
cleavage  of  peptide  bonds.  The  14  identical  p  subunits 
of  the  Thermoplasma  proteasome  form  14  identical  ac¬ 
tive  sites,  which  catalyze  the  cleavage  of  substrates 
after  hydrophobic  amino  acid  residues  (chymotrypsin- 
like  active  sites).  These  sites  are  located  on  the  inner 
surface  of  a  central  chamber  formed  by  the  two  rings 
of  p  subunits  (Lowe  et  al.,  1995;  Seemuller  et  al.,  1995). 
p  subunits  are  synthesized  as  inactive  precursors  con¬ 
taining  a  propeptide  that  is  thought  to  be  cleaved  off 
autocatalytically,  yielding  the  mature  p  subunit  bearing 
N-terminal  Thr.  The  propeptides  of  p  subunits  are  not 
required  for  the  in  vitro  assembly  of  the  Thermoplasma 
proteasome  (Seemuller  et  al.,  1996). 

in  eukaryotic  proteasomes,  only  3  of  the  7  distinct  p 
subunits  contain  the  three  conserved  residues  required 
for  activity  of  the  Thermoplasma  proteasome.  Genetic 
and  structural  data  suggest  that  these  three  subunits 
provide  the  active-site  nucleophiles  for  the  three  distinct 
catalytic  activities  of  eukaryotic  proteasomes,  namely 
the  “chymotrypsin-like”  (see  above),  the  “trypsin-like” 
(cleavage  after  basic  residues),  and  the  “peptidylgluta- 
myl  peptide  hydrolyzing  (PGPH)”  activity  (cleavage  after 
acidic  residues).  In  S.  cerevisiae,  these  three  p  subunits 
are  Pre2p/Doa3p,  Pupl  p,  and  Pre3p  (Heinemeyer  et  al., 
1993;  Chen  and  Hochstrasser,  1996;  Arendt  and  Hoch¬ 
strasser,  1997;  Groll  et  al.,  1997;  Heinemeyer  et  al., 
1997).  In  the  human  20S  proteasome,  the  related  pro¬ 
teins  are  MB1,  6,  and  Z;  during  the  immune  response, 
these  subunits  can  be  replaced  by  their  respective  ho¬ 
mologs  LMP7.  LMP2,  and  MECL-1/LMP10  (reviewed  by 
Coux  et  al.,  1996).  In  the  case  of  LMP2  and  LMP7,  it 
has  been  demonstrated  that  these  replacements  are 
functionally  relevant  in  altering  the  specificity  of  antigen 
presentation  by  the  MHC  class  I  pathway.  Tlie  catalytl- 
cally  active  p  subunits  of  the  eukaryotic  proteasome  are 
synthesized  with  propeptides,  similarly  to  the  p  subunit 
in  Thermoplasma  (Schmidtke  et  al.,  1997). 

In  the  crystal  structure  of  the  S.  cerevisiae  20S  protea¬ 
some,  the  opening  to  the  proteasome’s  interior,  formed 
by  the  outer  ring  of  the  a  subunits,  is  not  large  enough 
to  admit  even  an  unfolded  polypeptide  chain,  let  alone 
a  folded  protein  (Groll  et  al.,  1997).  The  degradation  of 
larger  substrates  requires  the  19S  complexes,  which 
attach  at  both  sides  of  the  20S  proteasome,  yielding  the 
26S  proteasome.  The  19S  complexes  contain  subunits 
that  bind  multi-Ub  chains,  at  least  one  Ub-specific  iso¬ 
peptidase  that  disassembles  these  chains,  and  several 
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ATPases  that  are  thought  to  be  Involved  in  perturbing 
the  substrate  conformationally  and  guiding  it  to  the  inte¬ 
rior  of  the  20S  proteasome  (Deveraux  et  al.,  1 994;  Hoch- 
strasser  et  al.,  1 995;  Jentsch  and  Schlenker,  1995;  Coux 
eta!.,  1996). 

Studies  with  mammalian  cells  have  shown  that  the 
20S  proteasome  is  assembled  through  a  15-16S  inter¬ 
mediate,  apparently  a  half-proteasome.  This  intermedi¬ 
ate  contains  all  14  a  and  p  subunits,  some  of  which  are 
in  the  precursor  (propeptide-bearing)  form,  and  several 
uncharacterized  polypeptides  as  well  (Frentzel  et  al.. 
1994;  Yang  et  al..  1995;  Nandi  et  al.,  1997;  Schmidtke 
et  al.,  1997).  Studies  on  the  yeast  20S  proteasome  have 
demonstrated  that  processing  of  proPre2p  (Identical  to 
proDoaSp)  Is  coupled  to  formation  of  the  20S  protea¬ 
some  from  two  half-proteasome  precursors  (Chen  and 
Hochstrasser,  1996).  Formation  of  the  active  site  capa¬ 
ble  of  autocatalytic  processing  of  proPre2p  depends  on 
the  juxtaposition  of  proPre2p  and  Prel  p  on  the  opposite 
sides  of  the  two  halves  of  the  proteasome.  This  mecha¬ 
nism  is  thought  to  prevent  activation  of  proteolytic  sites 
before  the  central  hydrolytic  chamber  has  been  sealed 
off  from  the  cytosol  through  association  of  the  two 
halves  of  the  proteasome  (Chen  and  Hochstrasser, 
1996).  The  Pre2p  propeptide  is  essential  for  the  forma¬ 
tion  of  functional  proteasomes.  Moreover,  this  propep¬ 
tide  can  operate  in  trans^  suggesting  that  it  serves  a 
chaperone-like  function  In  proteasome  maturation  (Chen 
and  Hochstrasser,  1996). 

In  the  present  work,  we  identify  Umpip,  a  novel  pro¬ 
tein,  as  a  component  of  a  precursor  complex  of  the  20S 
proteasome.  This  precursor  contains  unprocessed  p 
subunits.  Upon  formation  of  the  20S  proteasome  from 
two  such  precursors,  the  propeptides  of  p  subunits  are 
removed,  a  process  that  is  accompanied  by  removal  of 
Umpi  p  from  the  proteasome  through  Umpi  p  degrada¬ 
tion,  which  requires  the  proteasome’s  proteolytic  activ¬ 
ity.  In  umpi  A  cells,  which  lack  Umpip,  coordination  of 
20S  complex  formation  and  processing  of  p  subunits  is 
impaired,  resulting  in  incompletely  or  (in  the  case  of 
Pre2p)  prematurely  processed  p  subunits.  These  find¬ 
ings  reveal  Umpi  p  as  a  novel  type  of  molecular  chaper¬ 
one,  a  short-lived  maturation  factor  required  for  the 
efficient  biogenesis  of  the  20S  proteasome.  We  also 
describe  genetic  evidence  that  the  propeptide  of  pro- 
Pre2p  is  required  for  Umpi  p-dependent  proteasome 
maturation,  and  we  present  a  model  that  accounts  for 
some  of  the  functions  of  Umpip  and  propeptides  in 
proteasome  maturation. 

Results 

The  UMPI  Gene  Js  Required  for 
Ubiquitin-Mediated  Proteolysis 
To  identify  the  genes  required  for  Ub-dependent  prote¬ 
olysis  in  S.  cerevisiae,  we  screened  for  mutants  defec¬ 
tive  in  the  degradation  of  test  substrates.  One  such 
mutant,  termed  ump1-1  (ub-mediated  proteolysis),  ex¬ 
hibited  defects  in  the  degradation  of  several  normally 
short-lived  proteins  (see  below).  The  complementing 
UMPI  gene  encoded  a  148-resldue  (16.8  kDa)  protein. 
Searches  of  the  databases  did  not  identify  close  homo¬ 
logs  of  Umpip  but  did  detect  similarities  to  regions  of 


several  proteins.  One  intriguing  similarity  was  between 
Umpi  p  and  C-terminal  regions  of  the  protease  inhibitor 
contrapsin  (25%  identity,  52%  similarity)  and  related 
proteins  (Figure  1A),  consistent  with  the  likely  role  of 
Umpip  as  an  inhibitor  of  premature  proteolytic  acti¬ 
vation  of  the  proteasome  precursor  complexes  (see 
below). 

Pulse-chase  analyses  in  umpi  A  mutants  revealed  a 
strong  stabilization  of  several  normally  short-lived  test 
proteins,  in  particular  an  N-end  rule  substrate  Arg-p- 
galactosldase  (Arg-p-gal),  which  bears  N-terminal  Arg. 
a  destabilizing  residue  (Varshavsky,  1996),  Ub-Pro-p- 
gal  (a  substrate  of  the  UFD  pathway;  Johnson  et  al., 
1 995;  Varshavsky,  1 997),  and  a2«jeQi-p-9a*  substrate  of 
the  DOA  pathway;  Hochstrasser  and  Varshavsky,  1990; 
Hochstrasser  et  al..  1995)  (Figure  IB).  The  recognition 
and  ubiquitylation  of  these  substrates  involve  different 
recognins  (E3  proteins)  and  different  Ub-conjugating 
(E2)  enzymes.  The  pleiotropic  character  of  the  umpi  A 
phenotype  suggested  that  Umpip  functions  downstream 
of  the  recognition  and  ubiquitylation  components  of  the 
Ub  system,  perhaps  at  the  step  of  proteolysis,  or  in 
regulating  the  supply  of  Ub. 

umpi  A  Mutants  Are  Hypersensitive  to  a  Variety 
of  Stresses,  Accumulate  Ub-Protein 
Conjugates,  and  Do  Not  Sporulate 
umpi  A  mutants  grew  more  slowly  than  congenic  wild- 
type  (wt)  cells  at  30°C  or  lower  temperatures  and  were 
severely  growth-impaired  at  higher  temperatures  (37®C) 
(Figure  1C  and  data  not  shown).  In  addition,  they  were 
hypersensitive  to  cadmium  ions  and  to  the  arginine  ana¬ 
log  canavanine  (Figure  1C).  Similar  phenotypes  have 
been  reported  for  mutants  in  genes  encoding  E2  (Ubc) 
enzymes  or  proteasome  subunits  (Heinemeyer  et  al., 
1993;  Jungmann  et  al.,  1993).  Comparisons  of  proteins 
from  whole-cell  extracts  of  wt  and  umpi  A  cells  by  immu- 
noblotting  with  anti-Ub  antibodies  showed  a  dramatic 
accumulation  of  Ub-protein  conjugates  in  the  umpi  A 
mutant.  At  the  same  time,  the  level  of  free  Ub  was  re¬ 
duced  in  umpi  A  cells  (data  not  shown).  Thus,  the  pri¬ 
mary  cause  of  the  umpi  A  phenotype  appears  to  be  the 
impaired  ability  of  umpi  A  cells  to  degrade  Ub-protein 
conjugates.  Homozygous  umpi  A  diploids  (strain  JD61) 
were  unable  to  sporulate  (data  not  shown). 

Umpip  Is  a  Component  of  Proteasome  Precursors 
To  investigate  Umpi  p  biochemically,  an  epitope  tag  was 
linked  to  its  C  terminus.  A  single  copy  of  the  modified 
UMPI  gene  [UMPUha],  expressed  from  its  natural  pro¬ 
moter  and  chromosomal  location,  restored  wt  growth 
rates.  We  analyzed  whole-cell  extracts  of  a  strain  ex¬ 
pressing  Umpip-ha  and  Prelp-ha  (the  latter  an  ha- 
tagged  p  subunit  of  the  proteasome)  by  gel  fittration  on 
Superose-6.  Umpip-ha  eluted  in  fractions  22  and  23, 
both  of  which  also  contained  Prel  p-ha  and  other  protea- 
somal  subunits  (Figures  2A,  2B,  zmd  3A)  but  lacked  the 
chymotrypsin-like  activity  of  the  mature  20S  protea¬ 
some.  The  apparent  size  of  the  Umpi  p-containing  com¬ 
plex  was  300-400  kDa  (Figure  2A).  20S  and  26S  protea¬ 
somes,  as  well  as  free  19S  caps,  and  20S  proteasomes 
with  one  attached  19S  cap,  eluted  in  earlier  fractions 
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Figure  1.  Umpip  Is  Required  for  Ubiquitln- 
Mediated  Proteolysis 

(A)  Sequence  similarities  between  Umpip 
and  C'termlnal  parts  of  mouse  contrapsin. 
human  alpha!  antitrypsin  inhibitor  (aiact)  and 
mouse  spi-2  (PIR  accession  numbers,  re¬ 
spectively,  JX0129,  A90475,  and  S31305). 
The  sequences  were  aligned  using  the  PileUp 
program  (GCG  package,  version  7.2,  Genet¬ 
ics  Computer  Group,  Madison,  Wl).  Gaps 
(indicated  by  hyphens)  were  used  to  max¬ 
imize  alignments.  Residues  identical  between 
Umpi  p  and  at  least  one  of  the  other  proteins 
are  shaded  in  gray.  Residues  identical  among 
at  least  two  proteins  other  than  Umpip  are 
boxed. 

(B)  Pulse-chase  analysis  comparing  the  met¬ 
abolic  stabilities  of  R-p-gal,  Ub-P-*3-gal,  and 
MATa2,-68-P-gal  in  wt  {UMPI)  and  umpll 
cells.  The  open-ended  brackets  denote  the 
position  of  multiubiquitylated  p-gal  species. 
Asterisks  denote  an  kDa  p-gal  cleavage 
product  characteristic  of  short-lived  p-gal  de¬ 
rivatives  (Dohmen  et  a).,  1991). 

(C)  Growth  of  umpll  (JD59),  UMPI  (JD47- 
13C),  and  umpl^  cells  transformed  with 
YCp50-UMP7  (expressing  wild-type  UMPI). 
Cells  were  streaked  on  YPD  plates  and  incu¬ 
bated  for  2  days  at  SO'C  (or  at  37X  where 
indicated),  with  0.8  jig/ml  canavanine  or  30 
p.m  CdClj  where  indicated. 


(15-21),  as  determined  by  nondenaturing  gel  electro¬ 
phoresis  (Figure  2B  and  data  not  shown),  by  assays  of 
the  proteasome’s  proteolytic  activity,  and  by  immuno¬ 
logical  detection  of  Ctm3p,  a  subunit  specific  for  the 
19S  cap  of  the  26S  proteasome  (Ghisiain  et  al.,  1993). 

Cochromatography  of  Umpi  -ha  and  Prel  p-ha  in  frac¬ 
tions  22  and  23  (Figure  2A)  suggested  that  Umpip  is  a 
component  of  proteasome  precursor  complexes.  Con¬ 
sistent  with  this  possibility,  the  Umpi  p-containing  com¬ 
plex  had  a  higher  mobility  than  the  20S  proteasome 
upon  nondenaturing  gel  electrophoresis.  A  proteasome 
precursor  with  an  electrophoretic  mobility  Indistinguish¬ 
able  from  that  of  the  Ump1p-containlng  complex  was 
also  detected  in  extracts  from  a  strain  that  expressed 
Pupip-ha,  Mother  p  subunit  of  the  20S  proteasome 
(Figure  2B).  ' 

To  produce  independent  evidence  bearing  on  the  na¬ 
ture  of  the  Umpi  p-containing  complex,  we  constructed 
a  strain  that  expressed  both  Umpi  p-ha  and  doubly 
tagged  Prelp-Flag-HiSe  (Prelp-FH).  Affinity  chromatog¬ 
raphy  on  NI-NTA-agarose  and  antl-Flag  antibody  resin 
was  then  used  to  purify  complexes  containing  Prelp- 
FH.  Umpi  p-ha  cofractionated  with  Prelp-FH  in  both 
affinity  purification  steps  (Figures  2C  and  2D),  confirm¬ 
ing  that  Umpi  p  is  a  component  of  a  distinct  proteasome- 
related  complex.  This  complex  sedimented  at  M5S  in 
sucrose  gradients  (data  not  shown).  Taken  together, 


this  evidence  strongly  suggested  that  the  Umpi  p-con¬ 
taining  complex  Is  a  precursor  of  the  20S  proteasome 
that  is  similar  to  the  15-16S  precursors  observed  In  the 
maturation  pathway  of  mammalian  proteasomes  (Frent- 
zel  et  al.,  1994;  Yang  et  al.,  1995). 


The  Umpip  Proteasome  Precursor  Complex 
Contains  Unprocessed  p  Subunits 
Several  subunits  of  the  proteasome  are  synthesized  as 
precursors  containing  N-terminal  extensions  (propep¬ 
tides)  that  are  absent  from  the  mature  proteasome. 
These  precursors  are  detected  in  the  mammalian  15-16S 
complex  (Frentzel  et  al.,  1994;  Yang  et  at.,  1995).  To 
compare  the  Umpi  p-containing  complex  with  the  mam¬ 
malian  1 5-1 6S  complex,  we  analyzed  the  former  for  the 
presence  of  propeptide-containing  proteasomal  sub¬ 
units.  These  p  subunits  were  modified  by  the  addition  of 
C-terminal  ha  tags.  Strains  expressing  Pupl  p-ha,  Pre2p- 
ha,  and  Pre3p-ha  grew  at  wt  rates  (data  not  shown).  When 
extracts  of  these  strains  were  fractionated  by  gel  filtra¬ 
tion  on  Superose-6,  the  fractions  containing  the  Umpi  p 
complex  contained  largely  the  p  subunit  precursors, 
proPupl  -ha,  proPre2p-ha,  and  proPre3p-ha  (Figure  3A). 
The  corresponding  mature  p  subunits  largely  eluted  in 
the  earlier  fractions  that  contained  20S  and  26S  protea¬ 
somes  (Figure  3A).  Subtle  but  reproducible  differences 
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Figure  2.  Umpip  Is  a  Component  of  Proteasome  Precursor  Com¬ 
plexes 

(A)  An  extract  from  strain  JD127  expressing  Ump1p-ha  and  Pre1  p- 
ha  instead  of  wt  versions  of  these  proteins  was  fractionated  by 
gel  filtration  on  Superose-6.  The  upper  panel  shows  the  results  of 
measurements  of  chymotrypsin-like  activity  in  the  relevant  fractions 
(12-29).  The  last  fraction,  representing  the  void  volume  (Vo).  and 
positions  of  the  peaks  of  marker  proteins  are  indicated  by  dashed 
lines.  The  same  fractions  were  analyzed  by  SDS-PAGE  and  immu- 
noblotting  with  anti-Cim3p  and  antl-ha  antibodies,  which  detected 
Cim3p,  Prelp-ha,  and  Umpip-ha,  as  indicated. 

(B)  Analysis  of  Superose-fractionated  20S  proteasomes  and  precur¬ 
sor  complexes  by  nondenaturing  gel  electrophoresis  and  immu- 
noblotting  vwth  antl-ha  antibody.  The  strains  were  JD1 39  and  JD1 29 
expressing,  respectively.  Pupip-ha  and  Umplp-ha.  Asterisk,  20S 
proteasome  with  one  attached  19S  regulator  cap.  The  26S  protea¬ 
some  eluted  in  fractions  14-17  of  Superose-6  (not  shown). 

(C)  Affinity  purification  of  Pielp-FH  onJy|i-(NTA)-Sepharoso  and  co¬ 
purification  of  Umplp-ha.  Purification  of  proteins  from  an  extract 
of  strain  JD126  expressing  Umplp-ha  and  Prelp-FH  was  carried 
out  as  described  in  the  Experimental  Procedures.  Different  fractions 
were  analyzed  by  SDS-PAGE  and  immunoblotting  with  anti-Flag 
(upper  pariel)  and  anti-ha  antibodies  (lower  panel).  Ce.  cmde  extract; 
ft.  flow-through.  In  contrast,  no  binding  of  Umpip  to  the  resin  was 
observed  when  extracts  from  strain  JD129  expressing  untagged 
Prel  p  were  used  (not  shown). 

P)  Affinity  purification  of  Prelp-FH  on  anti-Flag  antibody  agarose 
resin.  Material  that  was  eluted  from  the  Ni-(NTA)-Sepharose  column 
shown  in  panel  C  was  sut^ected  to  a  second  affinity  purification 
on  anti-Flag  antibody  resin.  Prelp-FH  was  specifically  eluted  with 
increasing  concentrations  of  the  Flag  peptide.  W.  wash. 


were  detected  among  the  tagged  p  subunits  with  re¬ 
spect  to  the  distribution  of  their  precursor  and  pro¬ 
cessed  forms  in  various  fractions.  For  example,  pro- 
Pup1  p-ha  was  detectable  only  in  fractions  22  and  23. 
and  was  absent  from  fraction  21 .  By  contrast,  proPre2p- 
ha  was  detectable  in  the  larger  complexes  (down  to 
fraction  19),  while  fractions  22  and  23  already  contained 
some  processed  Pre3p-ha.  These  patterns  suggested 
a  defined  order  of  processing  events.  One  conclusion 
from  these  experiments  Is  that  the  Umpip-containing 
complex  is  a  precursor  of  the  20S  proteasome  that  con¬ 
tains  unprocessed  p  subunits  and  is  therefore  proteolyti- 
cally  inactive  (Figures  3A  and  3C). 

Umpip  Is  Required  for  Correct 
Proteasome  Maturation 

Next,  we  asked  what  effect  the  umpl^  mutation  has  on 
maturation  and  activity  of  the  proteasome.  Specifically, 
the  analyses  described  above  were  repeated  with  ex¬ 
tracts  obtained  from  umpll  cells.  Figure  3C  shows  that 
there  was  a  significant  reduction  in  the  three  proteolytic 
activities  of  the  proteasome  in  the  fractions  containing 
the  20S  and  26S  proteasomes  (fractions  14-21).  The 
reduction  of  the  specific  activity  appears  to  be  partially 
compensated  by  an  increased  expression  of  protea¬ 
somes  (Figures  3A  and  3B,  data  not  shown)  similar  to 
that  observed  in  cells  expressing  mutant  p  subunits 
(Arendt  and  Hochstrasser,  1 997).  In  addition,  a  new  (ab¬ 
sent  from  wt  cells)  peak  of  chymotrypsin-like  activity 
encompassing  fractions  22  and  23  was  detected  in  ex¬ 
tracts  from  umpfA  cells.  These  were  the  fractions  that 
contained  the  Umpip  complex  from  wt  extracts  (see 
above  and  Figure  3A),  suggesting  that  the  absence  of 
Umpip  from  the  proteasome  precursor  complex  was 
the  cause  for  premature  activation  of  its  chymotryptic 
activity  in  the  umpf  A  mutant.  To  verify  that  the  detected 
activity  was  actually  of  the  '^ISS  precursor  complex 
from  umpt  A  cells,  the  fraction  22  samples  from  strains 
expressing  either  wt  Pre2p  or  Pre2p-ha  were  incubated 
with  anti-ha  antibody  and  protein  A  Sepharose.  This 
treatment  did  not  deplete  the  chymotryptic  activity  from 
the  sample  containing  untagged  Pre2p  but  did  deplete 
~80%  of  the  activity  from  an  otherwise  identical  sample 
derived  from  umpi  A  cells  expressing  Pre2p-ha  (data  not 
shown).  Thus,  the  chymotryptic  activity  of  this  fraction 
resided  in  the  proteasome  precursor  complexes. 

To  follow  the  appearance  of  the  precursor  and  mature 
forms  of  p  subunits  in  proteasome  precursors  and  ma¬ 
ture  proteasomes,  we  analyzed  extracts  from  umplA 
cells  expressing  ha-tagged  versions  of  these  subunits 
by  immunoblotting  with  anti-ha  antibody.  The  process¬ 
ing  of  the  three  analyzed  p  subunits,  Pupip-ha,  Pre2p- 
ha,  and  Pre3p-ha,  was  strikingly  different  in  umpi A  and 
wt  cells  (Figure  3A).  For  all  three  subunits,  a  dramatic 
increase  of  their  precursors  was  detected  In  the  frac¬ 
tions  (1 4-21 )  that  contained  the  20S  and  26S  forms  of  the 
proteasome.  In  addition,  different  processed  variants, 
possibly  representing  processing  intermediates  of  pro- 
Pre2p,  could  be  detected  in  fractions  22  and  23,  which 
contained  the  M  5S  proteasome  precursor,  and  in  some 
of  the  proteasome-contalning  fractions  as  well  (Figure 
3A).  In  contrast,  with  the  exception  of  Pre3p,  almost 
no  processed  p  subunits  were  present  in  the  fractions 
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Figure  3.  Umplp  Is  Required  for  the  Correct 
Processing  of  Proteasomal  [i  Subunits 

(A)  SDS-PAGE  and  anti-ha  immunoblot  analy¬ 
ses  of  extracts  fractionated  on  Super(«e-6. 
The  congenic  yeast  strains  used  were  JDl  , 
JD132.  JD133,  JD134.  JD135.  and  JD136 
fTable  1).  Western  blots  of  the  umpfA  ex¬ 
tracts  were  developed  for  a  shorter  time  to 
compensate  for  an  increased  expression  of 
proteasomal  subunits  in  the  mutant.  Aster¬ 
isks.  processing  intermediates  or  incom¬ 
pletely  processed  forms  of  Pre2p-ha  that  were 
observed  in  the  umpl^  mutant  but  not  in  wt 
cells. 

(B)  Pulse-chase  analysis  companng  the  rate 
of  p  subunit  processing  in  wt  cells  to  that  in 
ump1^  cells  (same  strains  as  in  A).  Cells  were 
pulse-labeled  with  “S-Met/Cys  for  5  min.  Pro¬ 
teins  were  precipitated  with  anti-ha  antibody. 
Ruorographic  exposures  were  '-S  times 
longer  for  the  UMP1  samples.  P.  propeptide- 
containing  precursor  form;  m,  mature  form; 
asterisks,  as  in  (A);  dots,  nonspecific  bands 
present  in  some  samples. 

(C)  Comparison  of  proteolytic  activities  in  ex¬ 
tracts  with  equal  amounts  of  total  protein  of 
UMP1  (JDl  27)  and  wmptA  cells  (JD75)  frac¬ 
tionated  by  gel  filtration  on  Superose-6.  The 
profile  of  the  chymotrypsin-like  activity  in 
UMP1  is  the  same  as  in  Figure  2A. 
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containing  the  ~15S  proteasome  precursor  from 
cells,  and  no  processing  intermediates  of  Pre2p  could 
be  detected  either  (Figure  3A).  These  findings  indicate 
that  Ump1  p  is  required  to  prevent  premature  processing 
of  at  least  proPre2p  and  that  the  presence  of  Umpi  p  is 
Important  for  the  coordination  of  proteasome  assembly 
and  subunit  processing.  ^ 

A  defect  of  umpf  A  cells  in  the  processing  of  p  subunits 
was  also  observed  by  following  the  metabolic  fate  of 
these  subunits  in  pulse-chase  experiments.  In  vrt  cells, 
most  of  the  propeptide-containing  forms  of  p  subunits 
(proPupl-ha,  proPre2p-ha  and  proPre3p-ha)  were  con¬ 
verted  into  their  mature  counterparts  during  the  30-min 
chase  (Figure  3B).  By  contrast,  in  the  umpf  A  mutant,  the 
bulk  of  the  p  subunit  precursors  remained  unprocessed 
during  this  time,  a  finding  consistent  with  the  results  of 
immunoblot  analyses  (Figure  3A).  In  the  case  of  Pre2p- 
ha,  pulse-chase  analysis  again  revealed  the  species  of 
intennediate  size  (putative  processing  intermediates)  in 
extracts  from  umpf  A  cells.  These  species  were  ab^nt 
from  the  equivalent  samples  derived  from  wrt  cells  (Fig¬ 
ure  3B). 


Umplp  Is  Degraded  upon  Formation 
of  the  20S  Proteasome 

The  experiments  above  demonstrated  that  Umplp  is  a 
component  of  proteasome  precursors  that  Is  absent 
from  the  mature  20S  proteasome.  One  possibility  was 


that  Umplp  leaves  the  precursor  complex  upon  forma¬ 
tion  of  the  20S  proteasome  from  the  two  ~15S  precur¬ 
sors  and  may  then  assist  with  another  round  of  protea¬ 
some  assembly,  thus  acting  catalytically.  It  was  also 
possible  that  formation  of  the  20S  proteasome  stencally 
traps  Umplp.  Since  the  formation  of  the  20S  form  of 
the  proteasome  coincides  with  the  appearance  of  i  s 
proteolytic  activities  (Frentzel  et  al.,  1994;  Chen  an 
Hochstrasser.  1996),  Umplp  might  be  degraded  by  the 
newly  formed  20S  proteasome.  Pulse-chase  analyas  of 
Umpi  p-ha  in  wt  cells  showed  that  Umpi  p-ha  is  indeed 
degraded  in  vivo,  the  rate  of  its  degradation  being  similar 
to  the  rate  of  disappearance  of  proPupIp-ha,  which  is 
converted  to  Pupip-ha  upon  maturation  of  the  protea¬ 
some  (compare  Rgures  4A  and  3B). 


Umplp  Is  Stabilized  in  Proteasome  Mutants  and 
Can  Be  Detected  inside  the  20S  Particle 
If  Umplp  becomes  a  substrate  of  the  newly  formOT 
20S  proteasome,  the  degradation  of  Umpi  p  should  be 
inhibited  by  mutations  that  affect  the  proteasome's  pro¬ 
teolytic  activities.  We  used  pulse-chase  assays  to  follow 
the  metabolic  fate  of  Umpi  p-ha  in  the  pre1-1  mutant, 
which  is  known  to  be  deficient  in  the  chymotryptic  activ¬ 
ity  of  the  proteasome  (Heinemeyer  et  al.,  1993).  Vrepip 
was  partially  stabilized  in  preT-1  cells,  as  indicated  by 
the  increased  amount  of  *S-labeled  Umplp  in  pief-T 
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extracts  versus  wt  extracts  that  contained  the  same  total 
amount  of  TCA-precipitable  (Figure  4A).  However,  we 
still  observed  a  significant  decrease  of  pulse-labeled 
Ump1  p  upon  increasing  chase  times  when  the  immuno- 
precipitation  was  earned  out  under  nondenaturing  con¬ 
ditions  (Figure  4A).  By  contrast,  when  the  extracts  were 
treated  with  0.4%  SDS  at  100°C  before  immunoprecipi- 
tation,  virtually  no  decrease  of  the  Ump1  p  signal  during 
the  chase  was  observed  in  pref-t  cells.  These  results 
were  consistent  with  the  possibility  that  the  newly 
formed  Ump1  p-ha  became  inaccessible  to  the  anti-ha 
antibody  during  the  chase  because  it  became  trapped 
within  the  newly  formed  p/et-t  proteasome.  If  so, 
Ump1p  was  expected  to  be  present  in  fractions  from 
the  Superose-6  column  that  corresponded  to  the  20S 
and  26S  proteasomes  in  the  pre1~1  mutant.  Indeed, 
whereas  in  wt  {PRE1)  cells  Ump1  p-ha  was  detected  by 
SDS-PAGE  and  immunoblotting  only  in  fractions  22  and 
23  (corresponding  to  the  MSS  proteasome  precursor 
complex),  in  the  mutant  (pre7-T)  cells  Ump1p-ha  was 
detected  in  fractions  15-23,  indicating  that  It  was  also 
present  in  mature  proteasomes  (Figure  4B).  We  assayed 
the  accessibility  of  Ump1  p-ha  to  anti-ha  antibody  by 
Immunoprecipitation  from  fraction  19  (the  20S  protea¬ 
some)  of  extracts  from  thepre7-t  mutant  and  from  frac¬ 
tion  22  (the  M  5S  proteasome  precursor)  of  extract  from 
PRE1  cells.  Umpip-ha  could  be  immunoprecipitated 
from  either  fraction  1 9  or  22  following  pretreatment  with 
0.1  %  SDS,  but  not  under  nondenaturing  conditions  (Fig¬ 
ure  4C),  indicating  that  the  ha  tag  was  inaccessible  to 
the  anti-ha  antibody  In  both  thepre7-7  proteasome  and 
the  precursor  complex.  However,  when  otherwise  iden¬ 
tical  assays  were  carried  out  with  a  polyclonal  anti- 
Umpl  p  antiserum,  most  of  Umpi  p  could  be  immunopre¬ 
cipitated  under  nondenaturing  conditions  from  fraction 
22  (the  ^15S  proteasome  precursor)  but  was  still  not 
immunoprecipitated  from  'Action  19  (the  20S  pre7-7 
proteasome).  In  contrast,  when  polyclonal  anti-protea- 
some  antibodies  were  used,  Umpip  was  precipitated 
quantitatively  along  with  both  complexes  (Figure  4C). 

These  results  Indicated  that  a  part  of  Umpip  other 
than  its  C  terminus  is  accessible  to  anti-UmpI  p  antibod¬ 
ies  in  the  MSS  precursor.  However,  Umpip  becomes 
entirely  inaccessible  (under  nondenaturing  conditions) 
upon  formation  of  the  20S  proteasome  from  two  MSS 
precursors.  This  interpretation  was  supported  by  exam¬ 
ining  the  sensitivity  of  Umpip-ha  in  different  complexes 
to  trypsin  digestion  (Figure  40).  Specifically,  Umpi  p-ha 
in  the  20S  proteasomes  (fraction  19)  from  pre7-7  cells 
was  completely  protected  against  trypsin.  In  contrast, 
Umpi  p-ha  in  the  M  5S  proteeisome  precursors  was  de- 
tectably  accessible  to  trypsin.  The  degradation  of  Umpi  p 
was  incomplete,  resulting  in  a  protected  fragment  of 
Umpi  p-ha  that  lacked  the  M  kDa  N-termlnal  region  but 
retained  the  C-terminal  ha  tag  (Figure  4D).  Interestingly, 
in  a  similar  experiment  with  a  strain  expressing  pro- 
Pre2p-ha,  we  observed  that,  In  the  Umpi -containing 
MSS  proteasome  precursor  complex  (fraction  22),  pro- 
Pre2p-ha  was  shortened  by  trypsin  treatment  to  yield  a 
product  whose  electrophoretic  mobility  was  indistin¬ 
guishable  from  that  of  the  natural  mature  Pre2p-ha  (Fig¬ 
ure  4D).  In  these  experiments,  the  overall  structure  of 
the  Umpip  proteasome  precursor  complex  remained 
intact  as  judged  by  native  gel  analysis  of  trypsin-treated 
material  (data  not  shown). 
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Figure  4.  Umpip  Is  Stabilized  In pre1*1  Mutants  Cells  that  Are  De¬ 
fective  in  the  Proteasome’s  Chymotrypsin-like  Activity  and  Persists 
in  20S  Proteasomes 

(A)  Pulse-chase  analysis  of  Umpip-ha  In  wt  and  pref-f  cells.  The 
strains  used  were  JD150  (P/7E7)  and  JD151  (pre7-7)  (Table  1),  both 
expressing  Umpi  p-heu  The  cells  were  pulse-labeled  with  ^S-Met/ 
Cys  for  5  min.  Extracts  were  prepared  from  samples  taken  at  differ¬ 
ent  chase  times.  Samples  were  then  split  in  halves.  One-half  (-SDS) 
was  subjected  to  Immunoprecipitation  with  antl-ha  antibody  ac¬ 
cording  to  the  standard  pulse-chase  protocol.  The  other  half  (+SDS) 
was  adjusted  to  0.4%  SDS  and  incubated  at  lOO^'C  for  5  min,  then 
diluted  with  extraction  buffer  to  the  final  concentration  of  0.1  %  SDS 
prior  to  immunoprecipitation. 

(B-D)  SDS-PAGE  and  anti-ha  immunoblot  analyses.  (8)  Detection 
of  Umpip-ha-containing  complexes  in  extracts  fractionated  by  Su- 
perose-6  gel  filtration.  Strains  were  the  same  as  in  (A).  Note  the 
accumulation  of  Umpip-ha  in  fractions  containing  the  20S  and 
26S  proteasome  (fractions  15-21)  in  pfe7-7.  (C)  immunoblot  analy¬ 
ses  of  immunoprecipitations  were  carried  out  with  material  from 
Superose-6  fraction  19  of  extracts  from  strain  JD151  and  fraction 
22  of  extracts  from  strain  JD150,  using  the  indicated  antibodies. 
Immunoprecipitations  were  performed  without  addition  of  SDS 
(-SDS)  or  after  boiling  in  the  presence  of  0.1%  SDS  (+SDS).  P, 
precipitates;  S,  supernatants.  P)  Assaying  trypsin  sensitivity  of  the 
same  material  as  in  (C),  and  of  the  Superose-6  fraction  22,  from  strain 
JD138  expressing  Pre2p-ha.  P,  proPre2p-ha;  m,  mature  Pre2p-ha; 
asterisks  mark  trypsin  digestion  products. 


In  the  umpi  A  Mutant,  the  Propeptide  of  Pre2p 
is  Not  Required  for  incorporation 
of  Pre2p  Into  the  Proteasome 
Chen  and  Hochstrasser  (1996)  have  elegantly  demon¬ 
strated  that  the  propeptide  of  Pre2p,  if  separated  from 
the  mature  Pre2p,  could  function  in  irons  and  thereby 
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Figure  5.  The  Pre2p  Propeptide  Is  Not  Essentia!  in  the  umpA  Mutant 

(A)  Representation  of  different  genotypes  of  a  set  of  congenic  strains 
and  their  effects  on  cell  viability.  Strain  1  (JD50.1  OB)  is  the  wt  control. 
Strain  2  (MHY952)  expresses  Pre2p/Doa3p-ALS  with  N-temiinal  Thr 
(T)  as  a  fusion  to  ubiqurtin  (not  shown),  and  the  Pre2p/Doa3p  propep¬ 
tide  (LS)  from  two  separate  plasmids  (Chen  and  Hochstrasser,  1 996). 
Strain  3  (JD160),  same  as  strain  2  but  umpll.  Strain  4  would  be 
derived  from  strain  2  through  the  loss  of  the  plasmid  encoding 
Pre2p/Doa3p-LS  that  operates  in  frans.  We  were  unable  to  produce 
isolates  that  lost  this  plasmid,  confirming  the  earlier  demonstration 
that  the  propeptide  is  essential  for  cell  viability  (Chen  and  Hochstras¬ 
ser,  1996).  Strain  5  (JD163)  Is  derived  from  strain  3  through  the  loss 
of  the  propeptide-encoding  plasmid.  Loss  of  the  plasmid  did  not 

affect  cell  viability  in  the  omptA  background. 

(B)  Induction  of  UMP1  expression  in  the  omptA  background  Inhibits 
the  growth  of  a  strain  that  lacks  the  Pre2p  propeptide.  Strains  3 
(plus)  and  5  (minus)  shown  in  (A)  were  transformed  with  pJD429  that 
expressed  UMP1  from  the  galactose-inducible,  glucose-repressible 
Pgali  promoter,  or  with  an  empty  vector  as  a  control.  Transformants 
selected  on  glucose  media  were  pregrown  on  selective  media  con¬ 
taining  raffinose  as  a  carbon  source,  then  streaked  onto  selective 
media  vrith  2%  galactose  or  2%  glucose,  and  grown  for  3  days  at 
30“C. 


Still  allow  the  incorporation  of  Pre2p  into  the  20S  protea¬ 
some.  Under  these  conditions,  expression  of  the  pro¬ 
peptide  is  essential  for  cell  viability,  suggesting  that  the 
propeptide,  in  c/s  or  at  least  in  frans,  is  required  for  the 
assembly  of  an  active  proteasome  (Chen  and  Hochstras¬ 
ser,  1996). 

We  confirmed  this  result  using  their  strain  MHY952, 
in  which  mature  Pre2p  and  the  propeptide  are  expressed 
on  separate  plasmids  (Figure  6A),  Specifically,  under 
nonselective  growth  conditions  this  strain  did  not  yield 
viable  cells  that  had  lost  the  plasmid  expressing  the 


propeptide.  Surprisingly,  however,  when  we  construct^ 
and  examined  a  congenic  umptA  denvative  of  this 
strain,  we  noticed  that  it  lost  the  propeptide-expressing 
plasmid  at  a  frequency  suggesting  that  this 
not  provide  a  significant  growth  advantage  to  cells  (data 
not  shown).  Indeed,  growth  rates  of  cells  with  the  plas¬ 
mid  were  indistinguishable  from  those  lacking  it  (Figure 
5B).  This  result  demonstrated  that  ompf  A  is  a  suppres¬ 
sor  of  a  deletion  of  the  propeptide  of  Pre2p.  If  so,  expres¬ 
sion  of  UMP1  in  the  {ump1^  PRE2-^pro)  background 
should  be  incompatible  with  cell  viability.  This  prediction 
was  confirmed  when  we  examined  growrth  properties 
of  the  {,ump1^  PPE2-i4>ro)  strain  transformed  with  a 
plasmid  expressing  UMP1  from  the  galactose-induci¬ 
ble  Pau.1  promoter.  On  glucose-containing  media,  the 
growth  rate  of  this  transfonaant  was  indistinguishable 
frpm  that  of  an  othenwise  identical  transformant  carrying 
the  propeptide-expressing  plasmid.  In  contrast,  on  ga¬ 
lactose-containing  media,  the  (umplA  PRE2~^pro)  mu¬ 
tant  containing  the  Poj^rUMPI  plasmid  was  unable  to 
grow,  whereas  the  control  strains  grew  (Figure  5B).  This 
result  indicated  that  the  propeptide  of  Pre2p  is  essential 
for  the  Umpip-assisted  maturation  of  the  proteasome 
but  is  not  essential  for  (partially)  defective  maturation 
of  the  proteasome  that  takes  place  in  the  absence  of 
Umptp. 


Discussion 

Ump1p,  a  Novel  Maturation  Factor 
of  the  20S  Proteasome 

We  describe  the  discovery  of  a  proteasome  maturation 
factor,  termed  Umpi  p,  vrhose  unique  properties  include 
a  short  in  vivo  half-life  that  is  due  to  fts  d^radatton 
within  the  newly  formed  proteasome.  Umpi  p  is  required 
for  coordination  of  the  proteasome’s  physical  assembly 
and  enzymatic  activation.  In  addition,  the  normally  es¬ 
sential  propeptide  of  the  Pre2p  ^  subunit  was  found 
to  become  nonessential  in  the  absence  of  Umpip.  We 
report  the  following  specific  results, 
umpi  A  Mutants  Are  Defective  in 
Ub-Mediated  Proteolysis 

They  are  sensitive  to  a  variety  of  stresses  and  accu¬ 
mulate  Ub-protein  conjugates  (Figure  1  and  data  not 
shown).  All  of  the  observed  phenotypes  of  umpi  A  cells 
are  consistent  with  the  conclusion  that  umpi  mutants 
are  impaired  in  proteasome  biogenesis  and,  conse¬ 
quently,  in  the  degradation  of  ubiquitylated  proteins. 
The  UMPI  Gene 

It  encodes  a  polypeptide  with  a  calculated  molecular 
mass  of  16.8  kDa.  No  close  sequence  homologs  of 
Umpip  were  detected  in  the  current  databases.  How¬ 
ever,  the  presence  of  a  small  protein  similar  in  size  to 
Umpip  in  preparations  of  the  half-proteasome  precur¬ 
sors  in  mammals  (Frentzel  et  al.,  1 994;  Yang  et  al.,  1 995; 
Nandi  et  al.,  1 997;  Schmidtke  et  al.,  1 997)  suggests  the 
presence  of  a  functional  homolog  of  Umpi  p  in  the  mam¬ 
malian  proteasome  maturation  pathway. 

Umpip  Is  a  Component  of  Proteasome  Precursors 
that  Contain  Unprocessed  p  Subunits 
The  Umpip-containing  complex  has  a  molecular  mass 
of  300-400  kDa,  sediments  at  ~15S,  migrates  signifi¬ 
cantly  faster  in  native  gels  than  the  20S  proteasome. 
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D  Figure  6.  A  Model  of  the  Ump1  p  Function  in 

Proteasome  Maturation 


Shown  is  a  schematic  view  of  the  20S  protea- 
some  and  its  precursor  forms*  with  the  a  and 
3  subunits  as  blue  and  green  balls,  respec¬ 
tively.  Three  of  the  3  subunits  are  drawn  with 
extensions  that  represent  propeptides.  The  3 
subunits  in  the  front  are  drawn  transparent  in 
order  to  allow  a  view  into  the  interior  chamber 
of  the  proteasome.  Structure  A  is  a  protea- 
some  precursor  complex  (half  proteasome), 
characterized  by  the  presence  of  Ump1  p  and 
unprocessed  3  subunits.  In  step  1,  two  of 
these  precursors  join  to  build  structure  B,  a 
step  that  leads  to  conformational  or  posi¬ 
tional  shifts  of  Ump1  p  and  propeptides.  Con¬ 
formational  changes  of  the  propeptides  trig¬ 
ger  their  autocatalytic  processing  (step  2), 


and  activation  of  the  proteasome^s  proteolytic  activities.  This  leads  to  the  degradation  of  the  chamber-entrapped  Umpi  p  by  the  newly  formed 
proteasome  (step  3).  Only  structures  A  and  D  are  long-lived  enough  to  be  detected  in  wt  cells.  The  findings  with  the  pre7-T  mutant,  m  which 
the  degradation  of  Ump1p  is  inhibited,  suggest  the  existence  of  the  short-lived  intermediates  B  and  C.  See  text  for  details. 


and  has  a  subunit  composition  that  is  highly  similar  to 
that  of  the  20S  proteasome,  as  determined  by  SDS- 
PAGE  and  immunoblot  analyses  with  antl-20S  protea¬ 
some  antibodies  (Figure  2  and  data  not  shown).  Taken 
together,  these  results  indicate  that  the  Ump1p  is  con¬ 
tained  within  a  half-proteasome  precursor  complex, 
whose  counterpart  has  been  described  in  the  maturation 
pathway  of  mammalian  proteasomes  (Frentzel  et  al., 
1994;  Yang  et  al.,  1995;  Nandi  et  al.,  1997;  Schmidtke 
etal.,  1997). 

Maturation  of  Pupip,  Pre2p,  and  Pre3p  is  Strongly 
Impaired  in  the  ump1A  Mutant 
Considerably  increased  amounts  of  the  unprocessed 
(propeptide-containing)  forms  of  these  p  subunits  were 
detected  In  the  20S  and  26S  fractions  from  umpi  A  cells 
(Figure  3).  In  addition,  the  proteasomes  assembled  In 
umpl^  cells  had  reduced  activity.  Thus,  Umpip  is  re¬ 
quired  for  the  correct  and  efficient  maturation  of  the 
proteasomes. 

Partially  Processed  Forms  of  Pro~Pre2p  Are  Detected 
in  the  Proteasome  Precursor  Complex  and  the 
Proteasome  of  umpi  A  Cells  but  Not  in  WT  Cells 
These  incompletely  and  prematurely  processed  forms 
of  Pre2p  (Figures  3A  and  3B)  appear  to  underlie  a  chy- 
motrypsin-like  activity  that  was  associated  with  the  pre¬ 
cursor  complex  in  umpi  A  cells  but  was  absent  from  the 
same  complex  of  wt  cells  (Figure  3C).  This  was  sug¬ 
gested  by  the  observation  that  this  activity  was  not  in¬ 
hibited  by  treatment  with  lactacystin  and  was  absent 
from  cells  lacking  the  Pre2p  propeptide  (P.  C.  R.  and 
R.  J.  D.,  unpublished  data).  These  findings  indicated 
that  Umpip  has  a  dual  rolefn  proteasome  maturation. 
Specifically,  Umpip  prevents  premature  processing  of 
proPre2p  in  the  precursor  complex  and  is  also  required 
for  the  correct  maturation  of  active  sites  upon  assembly 
of  the  proteasome  (see  below). 

Umpip  Is  Degraded  by  the  Newly 
Formed  Proteasome 

The  kinetics  of  the  rapid  degradation  of  Umpi  p  is  similar 
to  the  kinetics  of  processing  of  Pupl  p  (Figures  3B  and 
4A),  suggesting  that  Umpi  p  is  destroyed  by  the  protea¬ 
some  upon  Its  formation  from  the  two  half-proteasome 
precursors  and  the  accompanying  maturation  of  active 


sites  (Chen  and  Hochstrasser,  1996).  This  conclusion 
was  strongly  supported  by  the  observation  that  Umpi  p 
is  significantly  stabilized  in  the  pret-f  mutant,  which  Is 
deficient  in  the  proteasome’s  chymotrypsin-like  activity 
(Figure  4A),  Specifically,  In  contrast  to  the  pattern  in  wt 
cells,  Umpi  p  was  detectable  in  the  20S  and  26S  forms 
of  the  pret-t  proteasome.  Umpip  in  these  complexes 
was  shielded  both  from  detection  by  antibodies  and 
from  digestion  by  trypsin  (Figures  4B-4D).  These  and 
related  data  suggested  a  model  in  which  Umpip  is  en¬ 
cased  within  the  20S  proteasome  upon  its  assembly 
from  the  two  Umpip-containing  half-proteasome  pre¬ 
cursor  complexes  (Figure  6).  Formation  of  the  20S  struc¬ 
ture  triggers  active  site  maturation,  resulting  in  the  deg¬ 
radation  of  Umpip. 

The  Propeptide  of  Pre2p  Is  Not  Essential 
in  umplA  Cells 

Chen  and  Hochstrasser  (1996)  demonstrated  that  the 
propeptide  of  Pre2p  is  essential  for  cell  viability  and  that 
this  propeptide  can  operate  in  trans.  They  concluded 
that  the  propeptide,  in  addition  to  rendering  proteasome 
precursors  proteolytically  inactive,  serves  a  chaperone¬ 
like  function  required  for  the  correct  incorporation  of 
Pre2p  into  the  proteasome.  One  striking  result  of  the 
present  work  is  that  the  propeptide  of  Pre2p  becomes 
dispensable  In  the  umpi  A  mutant  (Figure  5).  In  fact, 
the  presence  of  the  propeptide,  which  is  essential  for 
viability  of  wt  cells,  does  not  provide  a  growth  advantage 
to  umpi  A  cells  that  express  it. 

On  the  Functions  of  Umpip  and  p  Subunit 
Propeptides  in  Proteasome  Maturation 
Our  results  suggested  a  model  Illustrated  in  Figure  6. 
This  model  is  based  in  part  on  the  idea  described  by 
Chen  and  Hochstrasser  (1996)— that  the  active  sites  of 
the  proteasome  are  formed  upon  its  assembly  from  two 
half-proteasome  precursors,  through  a  juxtaposition  of 
subunits  at  the  interface  of  the  dyad-related  halves.  The 
advantage  of  this  mechanism  is  that  autocatalytic  matu¬ 
ration  of  the  active  sites  (through  processing  of  the  rele¬ 
vant  p  subunits)  is  coupled  to  the  assembly  of  the  pro¬ 
teasome,  thus  avoiding  premature  processing  of  the 
propeptides  of  p  subunits.  This  view  was  supported  by 
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Table  1 .  Yeast  Strains 


Strain 

YPH500 

JDC9-11 

JD47-13C 

JD59 

JD53 

JD81-1A 

JD75 

JD126 

JD127 

JD129 

JD131 

J0132 

JD133 

JD134 

JD135 

JD136 

JDt38 

JD139 

BBY45 

JD61 

JD50-10B 

MHY952 

JD160 

JD163 

WCG4a 

YHI29/1 

JD150 

JD151 


Relevant  Genotype _ _ _ _ 

MATu  ade2-101  his3-^00  Ieu2-S1  Iys2-801  f/pt-A63  ura3-52 
MATa  ump1-1 

MATa  his3-:^00  Ieu2-3,112  ty^-801  trp1-S63  ura3^52 

MATa  umpUM::HtS3 

MATa 

MATa  ump1-M::HIS3 

MATa  PREUha::Ylplac21 1  ump1-M::HtS3 

MATa  UMPUha::Ylplac128  PRE1-Flag-6His::Ylptac211 

MATa  UMP1-ha::Ylplac128  PRE1-ha::Ylplac211 

MATa  UMP1’ha::Ylplac128 

MATa  UMP1-ha::Ylplac128  PUPUha::Ytpiac211 

MATa  ump1’M::HIS3  PUPUha::Ylplac211 

MATa  UMPUha::Ylplac128  PRE2-ha::Ylplac211 

MATa  ump1-M::HiS3  PRE2’ha::Yiplac211 

MATa  UMPUha::Ylplac128  PRE3-ha::Ylpiac211 

MATa  umpU11::HIS3  PRE3-ha::Ytptac21 1 

MATa  PRE2-ha::Ylptac21 1 

MATa  PUP1-ha::Ylplac211 

MATa  his3'^200  Ieu2-3,112  tys2-801  trp1-1  ura3-52 
MATa/MATa  ump1-M::HIS3/umpUS2::LEU2 
MATa  Ieu2‘3,112  his3-1200  trp1-1  ura3‘52 
MATa  Ieu2‘3.112  his3-S200  trp1-1  ura3-52 
pre2(=doa3)M::HIS3  (YCpUbD0A3ALS-His)  (YEpDOA3us) 
MATa  wmp7-A3 

pre2(^doa3)M::HtS3  (YCpUbD0A3ALS-His)  (YEpDOASts) 
MATa  umpt-A3 

pre2(=doa3)-M::HIS3  (YCpUbDOA3ALS-His) 

MATa  ura3  his3‘11  Ieu2’3,112 
MATa  pret-7  ura3  hi$3-11  Ieu2’3,112 
MATa  UMPUha::Ytplac128 

MATa  pre1-1  UMP1-ha::Ylplac128 _ _ 


Source/Comment 

Sikorski  and  Hieter,  1989 
Derivative  of  YPH500 
Dohmen  et  al..  1995 
Derivative  of  JD47-1 3C 
Derivative  of  JD47-13C 
Derivative  of  JD53 
Derivative  of  J  047-1 3C 
Derivative  of  JD47-1 3C 
Derivative  of  JD71 
Derivative  of  JD1 27 
Derivative  of  JD1 29 
Derivative  of  JD59 
Derivative  of  JD129 
Derivative  of  JD59 
Derivative  of  JD1 29 
Derivative  of  JD59 
Derivative  of  JD47-1 3C 
Derivative  of  JD47-1 3C 
Bartel  et  al..  1990 
Congenic  with  BBY45 
Congenic  with  BBY45 
Chen  and  Hochstrasser,  1996, 
congenic  with  BBY45 
Derivative  of  MHY952 

Derivative  of  JD1 60, 
cured  of  YEpDOA3i.s 
Heinemeyer  et  al.,  1 993 
Heinemeyer  et  al.,  1993 
Derivative  of  WCG4a 
Derivative  of  YH1 29/1 


biochemical  analyses  of  the  proteasome  maturation  in 
mammalian  cells  (Frentzel  et  al.,  1994;  Yang  et  al.,  1995; 
Nandi  et  al.,  1997;  Schmidtke  et  al.,  1997). 

The  results  of  the  present  work  further  support  this 
model.  In  addition,  we  discovered  that  a  novel  factor, 
Umpi  p,  is  required  for  autocatalytic  active  site  matura¬ 
tion  and  proteasome  assembly  to  occur  in  a  coordinated 
fashion.  A  model  that  accounts  for  our  findings  and  is 
consistent  with  the  available  evidence  is  illustrated  in 
Figure  6.  In  this  model,  Ump1p  has  a  chaperone-like 
function  required  for  the  efficient  processing  of  the  pro¬ 
peptides  of  p  subunits  upon  the  assembly  of  20S  protea- 
somes.  Specifically,  Umpip  Interacts  with  these  p  sub¬ 
units,  most  likely  with  their  propeptides  (structure  A  in 
Figure  6).  Upon  formation  of  the  20S  proteasome  (Figure 
6,  step  1),  a  conformational  change  of  Umpip,  or  a 
change  of  its  position  within  the  complex,  would  induce 
a  conformational  change  of  the  propeptide  that  results 
in  autocatalytic  processing  (Figure  6,  step  2). 

Why  is  the  propeptide  of  Pre2p  essential  for  protea¬ 
some  assembly  and  activation  only  in  the  presence  of 
Umpip?  We  propose  that  the  propeptide  of  Pre2p  is 
required  to  induce  an  alteration  of  conformation  or  posi¬ 
tion  of  Umpi  p  upon  the  assembly  of  the  20S  particle. 
In  this  model,  the  absence  of  the  propeptide  would  leave 
Umpi  p  in  a  position  that  is  incompatible  with  the  forma¬ 
tion  of  active  proteasome.  It  is  possible  (and  remains 
to  be  verified)  that  this  chaperone-like  function  of  a  pro¬ 
peptide  is  unique  to  Pre2p,  as  It  contains  a  much  longer 
propeptide  (75  residues)  than  the  other  p  subunits. 

In  the  model  of  Figure  6,  Umpip  is  a  metabolically 


unstable  chaperone  that  is  required  for  the  proper  (and 
properly  timed)  processing  of  p  subunits  upon  the  as¬ 
sembly  of  the  20S  proteasome  and  that  is  destroyed 
within  the  newly  activated  proteasome.  The  unusual  me¬ 
chanics  of  Umpi  p,  its  noncatalytic  mode  of  action,  and 
Its  degradation  by  the  protease  it  helps  to  activate  char¬ 
acterize  Ump1  p  as  a  novel  type  of  molecular  chaperone. 

Experimental  Procedures 
Yeast  Media 

Yeast  rich  (YPD)  and  synthetic  (S)  minimal  media  with  2%  dextrose 
(SD)  or  2%  galactose  (SG)  were  prepared  as  described  (Dohmen  et 
ai.,  1995). 

Isolation  of  the  UMPI  Gene 

UMPI  was  identified  using  a  selection-based  screen  for  mutants  in 
the  N-end  rule  pathway  (reviewed  by  Varshavsky,  1 996).  S.  cerews- 
iae  strain  YPH500  (Table  1)  was  transformed  with  two  plasmids: 
pJD205,  which  expressed  a  Ura3p-based  N-end  rule  reporter  sub¬ 
strate  (Arg-Tpi1p-Ura3p),  and  pRL2.  which  expressed  Arg-p-gal, 
another  N-end  rule  reporter  substrate  that  can  be  monitored  using 
X-Gal  plate  assays  (Baker  and  Varshavsky.  1995).  Previous  work 
(Dohmen  et  al.,  1994)  has  shown  that  rapid  degradation  of  a  Ura3- 
based  N-end  rule  substrate  renders  the  cells  Ura" ,  whereas  mutants 
in  the  N-end  rule  pathway  that  express  the  same  Ura3p-based  re¬ 
porter  are  Ura\  In  this  screen,  Ura*  isolates  were  selected  on  SD 
media  lacking  uracil  and  were  then  tested  on  X-Gal  plates  to  verify 
that  the  same  isolates  were  also  defective  In  the  degradation  of  the 
Arg-3-gal  N-end  rule  substrate.  One  of  the  mutants  thus  identified 
(umpT-f)  was  cured  of  pJD205  and  transformed  with  a  genomic 
yeast  library  (Rose  et  at.,  1987).  Six  transfonnants  yielded  plasmids 
with  four  overlapping  inserts  that  could  restore  the  ability  of  colls 
to  degrade  Arg-p-gal.  A 1981  bp  BamHI/Sau3A  fragment  common 
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to  all  of  the  inserts  was  sequenced  and  found  to  contain  one  com¬ 
plete  ORF  and  two  flanking  incomplete  ORFs.  Further  mapping, 
using  subclonrng  and  PCR,  confirmed  that  the  complete  ORF  of  445 
bp  was  responsible  for  the  complementation.  The  UMP1  sequence 
(EMBL  database  accession  number:  AJ002557)  is  identical  to  ORF 
YBR173C.  an  ORF  subsequently  identified  by  the  yeast  genome 
project 

Construction  of  Yeast  Strains  and  Plasmids 
Table  1  lists  the  strains  used  in  this  study.  To  construct  umpfA 
alleles,  the  1981  bp  fragment  described  above  was  subcloned  Into 
M13mp19.  Using  single-stranded  DNA  of  the  resulting  phage,  a 
synthetic  oligonucleotide  and  T4  DNA  polymerase,  the  UMP1  ORF 
was  precisely  deleted  and  replaced  by  a  Bglll  restriction  site.  The 
resulting  fragment  was  subcloned  into  pUC19,  and  the  Bglll  site 
was  used  to  Insert  fragments  containing  the  HtS3,  LEU2,  or  the 
URA3  gene,  the  latter  one  being  flanked  by  two  direct  repeats  of  a 
segment  of  the  £  coli  hisG  gene  (Alani  et  al.,  1987).  The  resulting 
deletion  alleles  {umpUM::HIS3,  umpU12::LEU2,  and  umpU 
A3::l/RA3)  were  isolated  as  BamHI  fragments,  introduced  Into  S. 
cereWs/ae,  and  used  to  delete  the  UMP1  gene  (Rothstein,  1991). 
umpIS  strains  carrying  an  unmarked  ump1S3  allele  (resulting  from 
recombination  between  the  hisG  repeats)  were  selected  on  plates 
containing  5-fluoroorotic  acid  (Alani  et  al.,  1987).  Construction  of 
chromosomal  ORFs  that  expressed  C  terminally  tagged  versions  of 
Umpip  or  proteasome  subunits  (Prelp,  Pre2p,  Pre3p,  and  Pupip) 
instead  of  their  wt  counterparts  was  performed  as  follows.  Using 
primers  that  contained  flanking  EcoRI  and  Kpnl  sites.  3'  portions 
of  the  respective  genes  were  amplified  by  PCR.  These  sites  were 
then  used  to  insert  the  amplified  fragments  into  integrative  plasmids 
based  on  Ylplac128  iLEU2  marked)  or  Ylplac211  {URA3  marked) 
(Gietz  and  Sugino.  1988)  that  contained  sequences  encoding  epi¬ 
tope  tags  followed  by  the  terminator  sequence  of  the  CYCI  gene 
(Tcycf)*  Each  of  the  resulting  plasmids  was  linearized  within  the  cod¬ 
ing  sequence  for  targeted  integration  into  the  S.  cerevisiae  genome, 
yielding  strains  with  one  copy  of  the  respective  gene  (fused  in-frame 
to  the  tag-coding  sequence)  expressed  from  its  natural  promoter 
and  Tcvcft  in  addition  to  a  3'  portion  of  the  same  gene  without 
promoter.  The  epitope  tags  used  were  double  ha  (*‘ha’*)  and  Flag- 
HISs  (“FH")  (Dohmen  et  al.,  1995).  Plasmid  pJD429  {CEN/URA3)  ex¬ 
pressing  a  UMP1  cDNA  from  PGAL1  was  isolated  from  a  cDNA 
library  (Liu  et  al.,  1992)  through  complementation  of  the  umpll 
mutation. 

Pulse-Chase  Analyses 

R-$-gal  (Ub-R-p-gal)  and  Ub-P-3-gal  (Bachmair  et  al.,  1986)  were 
expressed  in  the  MATa  strains  JD47-13C  or  JD59.  MATa2,.Mrp-gal 
was  expressed  from  the  plasmid  YCp50-a2d,Bi<-3-gal  (Hochstrasser 
and  Varshavsky,  1 990)  in  the  MATa  strains  JD53  and  JD81  -1 A  (Table 
1).  Pulse  labeling  for  5  min  with  Redivue  Promix  pS]  (Amersham) 
followed  by  a  chase  and  immunoprecipitation  with  monoclonal 
anH>6-gat  (Promega)  or  anti-ha  (16B12,  Babco)  antibodies  were 
carried  out  as  described  by  Dohmen  et  al.  (1991).  ^  proteins  frac¬ 
tionated  by  SDS-PAGE  were  detected  by  fluorography. 

Fractionation  of  Whole-Cell  Extracts  by  Gel  Filtration 
S.  cerevisiae  were  grown  at  3(rC  in  YPD  or  in  SD  media  to  ODwo  of 
1.4  *  0.1,  harvested  at  3000g,  washed  with  cold  water,  frozen  in 
liquid  nitrogen,  and  stored  at  -80°C.  Cell  paste  was  ground  to 
powder  in  a  mortar  in  the  presence  of  liquid  nitrogen.  The  extraction 
buffer  was  50  mM  Tris-HCI  (pH  7.5),  2  mM  ATP,  5  mM  MgCIs,  1  mM 
DTT,  15%  glycerol,  used  at  2  ml  per  gram  of  pelleted  yeast  cells. 
After  centrifugation  at  31,000g  for  10  min  at  2*'C,  the  supernatant 
was  subjected  to  a  second  centrifugation  at  60,000g  for  30  min. 
yielding  an  extract  with  the  protein  concentration  of  ^5  mg/ml.  The 
protein  concentration  of  extracts  was  equalized  in  parallel  experi¬ 
ments,  using  the  extraction  buffer.  Using  the  FPLC  system  (Phar¬ 
macia).  200  M-t  samples  of  an  extract  were  chromatographed  on  a 
Superose-6  column  equilibrated  with  extraction  buffer.  The  flow 
rate  was  0.3  ml/min  and  fractions  of  0.6  ml  were  collected.  The 
Superose-6  column  was  calibrated  using  the  following  standards: 
thyroglobulin  (669  kDa),  ferritin  (443  kDa),  and  bovine  senim  albumin 
(66  kDa).  Dextran  blue  was  used  to  monitor  the  void  volume. 


Assays  for  Proteolytic  Activities  with  Ruorogenic 
Peptide  Substrates 

To  determine  the  chymotrypsin-like  activity.  20  ^l  of  the  protein 
fraction  and  20  m-I  of  0.5  mM  succinyl-Leu-Leu-VaI-Tyr-7-amido-4- 
methylcoumarin  in  50  mM  Tris-HCI  (pH  7.8),  2  mM  ATP.  5  mM  MgClj, 
and  1  mM  DTT  were  mixed  and  incubated  for  1  hr  at  37®C.  The 
reaction  was  stopped  by  addition  of  960  fil  of  cold  ethanol,  and 
the  fluorescence  was  measured  at  440  nm,  using  the  excitation 
wavelength  of  380  nm.  The  trypsin-like  activity  and  the  peptidylglu- 
tamyl  peptide-hydrolyzing  activity  were  determined  with,  respec¬ 
tively.  Cbz-Ala-Ala-Arg-4MeO-p-naphthylamide  and  Cbz-Leu-Leu- 
Glu-p-naphthylamide  as  fluorogenic  peptide  substrates  (Fischer  et 
al.,  1994),  except  that  the  volume  of  the  protein  fraction  was  75  m-I, 
the  reactions  were  stopped  after  270  min  at  37X,  and  the  fluores¬ 
cence  emission  was  measured  at  366  nm,  using  an  excitation  wave¬ 
length  of  420  nm.  One  unit  (U)  of  a  proteolytic  activity  is  defined  as 
1  ftmol  of  the  fluorophore  produced  per  min  under  these  conditions. 

Immunoprecipitation,  Electrophoresis,  and  Immunoblotting 
Immunoprecipitations  were  carried  out  using  either  monoclonal  anti¬ 
tag  antibodies  (see  below),  or  polyclonal  rabbit  antisera  raised 
against  the  yeast  20S  proteasome  (a  gift  from  K.  Tanaka),  or  against 
UmpIp-HiSe  expressed  in  £  coli.  For  immunoblotting,  the  protein 
samples  were  boiled  for  3  min  in  the  presence  of  2%  SDS  and 
0.1  M  2-mercaptoethanol,  then  subjected  to  12%  SDS-PAGE,  and 
thereafter  transferred  onto  a  PVDF  membrane  (Miliipore)  in  a  wet 
blot  system  (BioRad).  The  blots  were  incubated  with  either  rabbit 
anti-ubiquitin  (Ramos  et  al.,  1995),  or  anti-Cim3p  (Ghislain  et  al.. 
1993)  polyclonal  antibodies,  or  16B12  anti-ha  (BabCo),  or  M2  anti- 
Flag  (Kodak)  monoclonal  antibodies,  and  were  processed  as  de¬ 
scribed  (Ramos  et  al.,  1 995),  except  that  the  initially  blotted  proteins 
were  visualized  using  horseradish  peroxidase-conjugated  goat  anti¬ 
mouse  or  anti-rabbit  IgGs,  the  chemiluminescence  blotting  sub¬ 
strate  detection  system  from  Boehringer  Mannheim,  and  X-ray  films. 
Nondenaturing  4.5%  acrylamide  gel  electrophoresis  was  performed 
as  described  by  Hough  et  al.  (1 987).  and  the  gels  were  incubated  for 
1 5  min  in  transfer  buffer  containing  0.1  %  SDS  before  electrotransfer. 

Affinity  Purification  of  Proteasomes  and  Related  Complexes 
The  20S  proteasome  and  its  precursors  were  purified  from  strain 
JD126  that  expressed  Prelp-FH  and  Umpip-ha.  Crude  extracts 
were  prepared  as  described  above,  followed  by  an  exchange  of 
buffer  to  50  mM  Na-phosphate  (pH  8.0),  0.3  M  NaCI,  using  PD-10 
columns  (Pharmacia).  The  extract  was  then  incubated  with  Ni-NTA 
agarose  (Qiagen)  in  batch  for  2  hr  at  4''C,  followed  by  washings  and 
elution  according  to  the  manufacturer’s  protocol,  except  that  the 
(pH  6.0)  washing  buffer  contained  20  mM  imidazole.  Hise-tagged 
proteins  were  eluted  with  a  step  gradient  of  100-500  mM  imidazole. 
Active  fractions  containing  a  mixture  of  the  mature  proteasome  and 
its  precursors  were  pooled,  diluted  2-fold  with  50  mM  Tris-HCI  (pH 
7.5),  and  incubated  for  2  hr  at  4‘’C  in  batch  with  0.5  ml  anti-Flag  M2 
antibody  affinity  resin  (IBI/Eastman  Kodak)  that  had  been  equili¬ 
brated  in  TN  buffer  (0.15  M  NaCI,  50  mM  Tris-HCI.  [pH  7.5D.  After 
the  loading,  the  resin  was  washed  with  TN  buffer.  Flag-tagged  pro¬ 
teins  were  specifically  elutekf  with  the  Flag  epitofse  i^eptide  (IBl/ 
Eastman  Kodak). 
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ABSTRACT  The  N-end  rule  relates  the  in  vivo  half-Iife  of  a 
protein  to  the  identity  of  its  N-terminal  residue.  The  N-end  rule 
pathway  is  one  proteolytic  pathway  of  the  ubiquitin  system.  The 
recognition  component  of  this  pathway,  called  N-recognin  or  £3, 
binds  to  a  destabilizing  N-terminal  residue  of  a  substrate  protein 
and  participates  in  the  formation  of  a  substrate-linked  multiu- 
biquitin  chain.  We  report  the  cloning  of  the  mouse  and  human 
Ubrl  cDNAs  and  genes  that  encode  a  mammalian  N-recognin 
called  E3a.  Mouse  UBRlp  {E3a)  is  a  1,757-residue  (200-kDa) 
protein  that  contains  regions  of  sequence  similarity  to  the 
225-kDa  Ubrlp  of  the  yeast  Saccharomyces  cerevisiae.  Mouse  and 
human  UBRlp  have  apparent  homologs  in  other  eukaryotes  as 
well,  thus  defining  a  distinct  family  of  proteins,  the  UBR  family. 
The  residues  essential  for  substrate  recognition  by  the  yeast 
Ubrlp  are  conserved  in  the  mouse  UBRlp.  The  regions  of 
similarity  among  the  UBR  family  members  include  a  putative 
zinc  finger  and  R1NG-H2  finger,  another  zinc-binding  domain. 
Ubrl  is  located  in  the  middle  of  mouse  chromosome  2  and  in  the 
syntenic  15ql5-q21.1  region  of  human  chromosome  15.  Mouse 
Ubrl  spans  «^120  kilobases  of  genomic  DNA  and  contains  ^50 
exons.  Ubrl  is  ubiquitously  expressed  in  adults,  with  skeletal 
muscle  and  heart  being  the  sites  of  highest  expression.  In  mouse 
embryos,  the  Ubrl  expression  is  highest  in  the  branchial  arches 
and  in  the  tail  and  limb  buds.  The  cloning  of  Ubrl  makes  possible 
the  construction  of  -lacking  mouse  strains,  a  prerequisite 
for  the  functional  understanding  of  the  mammalian  N-end  rule 
pathway. 


A  number  of  regulatory  circuits  involve  metabolically  unstable 
proteins.  Short  in  vivo  half-lives  are  also  characteristic  of  damaged 
or  otherwise  abnormal  proteins  (1-4).  Features  of  proteins  that 
confer  metabolic  instability  are  called  degradation  signals,  or 
degrons.  The  essential  component  of  one  degradation  signal, 
called  the  N-degron,  is  a  destabilizing  N-terminal  residue  of  a 
protein  (5, 6).  The  set  of  amino  acid  residues  that  are  destabilizing 
in  a  given  cell  type  yields  a  rule,  called  the  N-end  rule,  which 
relates  the  in  vivo  half-life  of  a  protein  to  the  identity  of  its 
N-terminal  residue.  Similar,  but  distinct,  versions  of  the  N-end 
rule  pathway  are  present  in  all  organisms  examined,  from  mam¬ 
mals  to  fungi  and  bacteria  (6-8). 

In  eukaryotes,  the  N-degron  comprises  two  determinants:  a 
destabilizing  N-terminal  residue  and  an  internal  lysine  or  lysines 
(8).  The  Lys  residue  is  the  site  of  formation  of  a  multiubiquitin 
chain  (9).  The  N-end  rule  pathway  is  thus  one  pathway  of  the 
ubiquitin  (Ub)  system.  Ub  is  a  76-residue  protein  whose  covalent 
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conjugation  to  other  protcin.s  plays  a  role  in  a  multitude  of 
processes,  including  cell  growth,  division,  differentiation,  and 
responses  to  stress  (1,3, 4, 10).  In  most  of  the.sc  processes,  Ub  acts 
through  routes  that  involve  the  degradation  of  Ub-protcin  con¬ 
jugates  by  the  26S  protcasomc,  an  ATP-dependent  multisiibunit 
protease  (11). 

The  N-end  rule  is  organized  hierarchically.  In  the  yeast  Sac- 
chciromyces  cerevisiae,  Asn  and  Gin  are  tertiary  destabilizing 
N-terminal  residues  in  that  they  function  through  their  enzymatic 
deamidation  into  the  secondary  destabilizing  N-terminal  residues 
Asp  and  Glu  (12).  The  destabilizing  activity  of  N-tcrminal  Asp 
and  Glu  requires  their  enzymatic  conjugation  to  Arg,  one  of  the 
primary  destabilizing  residues  (6).  The  primary  destabilizing 
N-terminal  residues  arc  bound  directly  by  the  -encoded 
N-recognin  (also  called  E3),  the  recognition  component  of  the 
N-end  rule  pathway  (13).  In  S.  cerevisiae,  N-recognin  is  a  225-kDa 
protein  that  binds  to  potential  N-end  rule  substrates  through  their 
primary  destabilizing  N-tcrminal  residues — Phe,  Leu,  Trp,  Tyr, 
lie,  Arg,  Lys,  and  His.  N-rccognin  has  at  least  two  substrate¬ 
binding  sites.  The  type  1  site  is  specific  for  the  basic  N-tcrminal 
residues  Arg.  Lys,  and  His.  The  type  2  site  is  specific  for  the  bulky 
hydrophobic  N-terminal  residues  Phe,  Leu,  Trp,  Tyr,  and  He  (6). 

The  known  functions  of  the  N-end  rule  pathway  include  the 
control  of  peptide  import  in  S.  cerevisiae  (through  degradation  of 
Cup9p,  a  transcriptional  repressor  of  the  peptide  transporter 
Ptr2p);  a  role  in  controlling  the  SInIp-dependent  phosphoryla¬ 
tion  cascade  that  mediates  osmoregulation  in  S.  cerevisiae\  the 
degradation  of  Gpalp,  a  Ga  protein  of  S.  cerevisiae;  and  the 
degradation  of  alphaviral  RNA  polymerases  in  virus-infected 
metazoan  cells  (6,  14). 

The  mammalian  counterpart  of  the  yeast  -encoded  N- 
recognin  (E3)  was  characterized  biochemically  in  extracts  from 
rabbit  reticulocytes  (15-17).  Rabbit  E3a  was  shown  to  be  spe- 


Abbreviations:  Ub,  ubiquitin:  kb,  kilobase;  id.,  identity;  si.,  similarity; 
BAG,  bacterial  artificial  chromosome;  FISH,  fluorescence  in  situ 
hybridization;  e«,  embryonic  day. 

Data  deposition:  Nucleotide  sequences  reported  in  this  work  have 
been  deposited  in  the  GcnBank  database  [accession  nos.  AF061555 
(mouse  Ubrl  cDNA)  and  AF()61556  (human  UBR]  cDNA)]. 

**To  whom  reprint  requests  should  be  addressed  at:  Division  of 
Biology,  147-75,  Caltech,  1200  East  California  Boulevard,  Pasadena, 
CA  91125.  e-mail:  avarsh@cco.caltech,edu. 
ttThe  names  of  mouse  genes  are  in  italics,  with  the  first  letter 
uppercase.  The  names  of  human  and  5.  cerevisiae  genes  arc  also  in 
italics,  all  uppercase.  If  human  and  mouse  genes  are  named  in  the 
same  sentence,  the  mouse  gene  notation  is  used.  The  names  of  5. 
cerevisiae  proteins  arc  Roman,  with  the  first  letter  uppercase  and  an 
extra  lowercase  “p”  at  the  end.  The  names  of  the  corresponding 
mouse  and  human  proteins  are  the  same,  except  that  all  letters  but 
the  last  “p”  are  uppercase.  The  latter  usage  is  a  modification  of  the 
existing  convention  (33),  to  facilitate  simultaneous  discussions  of 
yeast,  mouse,  and  human  proteins.  In  some  citations,  the  abbrevi¬ 
ated  name  of  a  species  precedes  the  gene’s  name. 
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FKj.  1.  Peptides  of  rabbit  UBRlp  (E3«)  and  isola¬ 
tion  of  the  mouse  Ubrl  cDNA.  (A)  Amino  acid  se¬ 
quences  of  tryptic  peptides  of  the  purified  rabbit 
UBRlp  (see  Materials  and  Methods).  The  alternative 
sets  of  peptide  names,  T-based  and  PEP1-PEP3  (in 
parentheses),  refer  to  two  different  preparations  of 
E3a.  The  sequences  of  T120,  T76,  T96,  and  TI22  that 
were  encoded  by  DNA  sequences  identified  through 
intrapeptidc  PCR  arc  underlined.  Residues  deduced 
from  the  mouse  Vhrl  cDNA  that  differed  from  those 
inferred  through  peptide  sequencing  arc  indicated  in  a 
smaller  font.  The  peptides’  positions  in  the  deduced 
sequence  of  mouse  UBRlp  are  indicated.  (B)  The 
intrapcptidc/interpcptidc-PCR  cloning  strategy.  The 
products  of  the  initial  intrapeptidc  PCR,  derived  from 
rabbit  genomic  DNA,  were  used  to  carry  out  interpep- 
tidc  PCRwith  a  rabbit  liver  cDNA  library  (CLON- 
TECH).  The  resulting  392-bp  fragment  of 


the  rabbit  Ubrl  cDNA  was  used  to  isolate,  using  PCR  and  a  Agtll  mouse  liver  cDNA  library,  the  corresponding  392-bp  mou.se  Ubrl  cDNA  fragment. 


This  fragment  then  was  used  to  screen  the  same  cDNA  library,  yielding  a  2.4-kb  fragment  of  the  mouse  Ubrl  cDNA  that  encoded  several  of  the 
peptide-derived  sequences  of  the  rabbit  UBRlp.  The  encoded  sequence  was  also  significantly  similar  to  that  of  the  N-terminal  region  of  5.  cerevisiae  Ubrlp 
(13)  and  contained  the  putative  start  (ATG)  codon  of  the  mouse  Ubrl  ORE.  To  isolate  the  rest  of  the  5'  region  of  the  Ubrl  cDNA,  5'-rapid  amplification 
of  cDNA  ends  (RACE)-PCR  (20)  was  performed  with  poly(A)'^  RNA  from  mou.se  L  cells  and  a  primer  from  the  2.4-kb  DNA  fragment.  3'-R  ACE-PCR 


(20)  was  u.sed  to  amplify  a  downstream  region  of  Ubrl  cDNA.  The  resulting  DNA  fragment  (nucleotides  2,470-3,467)  then  was  u.sed  to  .screen  a  AgllO 
mouse  cDNA  library  from  MEL-C19  cells.  Five  overlapping  cDNA  isolates  (MR  16,  MR  17,  MR  19,  MR20,  and  MR23)  that  together  spanned  the  entire 
Ubrl  cDNA  were  mapped  and  subcloncd  into  Blucscript  II  SK+  (Stratagene),  yielding  the  plasmid  MR26,  which  contained  the  entire  ORF  of  Ubrl.  The 
ORF  region  of  Ubrl  cDNA  was  sequenced  on  both  strands  at  least  twice,  using  independently  derived  cDNA  clones. 


cifically  required  for  the  Ub-dependent  degradation  of  proteins 
bearing  either  type  1  (basic)  or  type  2  (bulky  hydrophobic) 
destabilizing  N-terminal  residues  (7,  15,  16). 

We  began  dissection  of  the  mouse  N-end  rule  pathway  by 
isolating  the  NtanI  gene,  which  encodes  the  asparagine-specific 
N-terminal  amidase  (18,  19),  a  component  of  the  mammalian 
N-end  rule  pathway,  and  by  constructing  mouse  strains  that  lack 
NtanI  (Y.T.K.  and  A.V.,  unpublished  data).  Herein,  we  describe 
the  cloning  and  characterization  of  the  mouse  and  human  cDNAs 
and  genestt  that  encode  UBRlp  (E3a),  a  homolog  of  yeast 
Ubrlp  and  the  main  recognition  component  of  the  N-end  rule 
pathway. 

MATERIALS  AND  METHODS 

Isolation  and  Partial  Sequencing  of  Mammalian  E3a 
(UBRlp).  Rabbit  E3a  was  purified  from  reticulocyte  extracts  by 
using  affinity  chromatography  with  immobilized  protein  sub¬ 
strates  of  UBRlp  and  elution  with  dipeptides  bearing  destabi¬ 
lizing  N-terminal  residues  (16).  The  resulting  preparation  was 


fractionated  by  SDS/PAGE.  The  band  of  «=^180-kDa  E3a  was 
excised  and  subjected  to  digestion  with  trypsin.  Amino  acid 
sequences  were  determined  for  14  peptides  of  rabbit  UBRlp  (Fig. 
lA)  by  using  standard  methods  (20). 

Isolation  of  the  Full-Length  Mouse  Ubrl  cDNA.  A  strategy  that 
included  the  intrapeptidc-interpeptide  PCR  (21)  was  used  (see 
the  legend  to  Fig.  1). 

Isolation  of  a  Partial  Human  UBRI  cDNA  PoIy(A)^  RNA 
from  human  293  cells  was  subjected  to  reverse  transcription- 
PCR,  using  sets  of  primers  corresponding  to  sequences  of  the 
mouse  Ubrl  cDNA.  One  of  the  reactions  yielded  a  1.0-kilobase 
(kb)  fragment  that  encompassed  a  region  of  the  human  UBRI 
cDNA  (Fig.  2). 

Mouse  and  Human  Genomic  Ubrl  Fragments.  A  library  of 
mouse  genomic  DNA  fragments  (strain  SvJ)  in  bacterial  artificial 
chromosome  (BAC)  (22)  vector  (Genome  Systems,  St.  Louis) 
was  used,  as  described  in  the  legend  to  Fig.  2. 

Northern,  Southern,  and  Whole-Mount  in  Situ  Hybrid¬ 
izations.  Mouse  and  human  multiple-tissue  Northern  blots 
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Fig.  2.  The  mouse  and  human  Ubrl 
cDNAs  and  genes.  Thick  horizontal  lines 
represent  genomic  DNA.  The  upper  one  is 
a  *«31-kb  fragment  of  the  mouse  Ubrl  gene 
that  corresponds  to  a  1.34-kb  region  of 
mouse  Ubrl  cDNA  (nucleotides  115- 
1,454).  Vertical  rectangles  represent  exons. 
Their  lengths,  and  the  lengths  of  the  in- 
trons,  are  indicated,  respectively,  below  and 
above  the  horizontal  line.  In  a  composite 
diagram  of  the  Ubrl  cDNA,  the  exons  are 
depicted  as  alternatively  shaded  rectangles. 
For  exon  1,  only  its  translated  region  is 
indicated.  Shown  belowthe  cDNA 


diagram  is  a  ‘«21-kb  fragment  of  the  human  UBRI  gene,  corresponding  to  1.0  kb  of  the  indicated  region  of  the  human  UBRI  cDNA  (nucleotides 
2,218-3,227  of  the  mouse  Ubrl  cDNA  sequence).  The  mouse  and  human  Ubrl  exons  are  denoted,  respectively,  by  numbers  and  letters.  Also  indicated 
are  the  exon  locations  of  some  of  the  type  1  and  type  2  substrate-binding  sites  of  N-recognin  (the  essential  amino  acid  residues  are  underlined)  (A.  Webster, 
M.  Ghislain,  and  A.V.,  unpublished  data;  see  the  main  text).  Not  shown  are  the  114-bp  5 '-untranslated  region  (UTR)  and  the  1,010  bp  3'-UTR  of  the 
mouse  Ubrl  cDNA.  To  isolate  mouse  Ubrl,  a  library  of  mouse  genomic  DNA  fragments  in  a  BAC  vector  (see  Materials  and  Methods)  was  screened  with 
a  fragment  of  the  mouse  Ubrl  cDNA  (nucleotides  105-1,333)  as  a  probe,  yielding  seven  BAC  clones,  of  which  BAC3  and  BAC4  contained  the  entire 
Ubrl  gene.  The  exon/intron  organization  of  the  first  31  kb  (*«1  /4)  of  the  mouse  Ubrl  gene  was  determined  by  using  exon-specific  PCR  primers  to  produce 
«='40  genomic  DNA  fragments  of  the  BAC3  insert  that  ranged  in  size  from  1.3  to  18  kb.  Regions  encompassing  the  exon/intron  junctions  then  were 
sequenced  by  using  intron-specific  primers.  Fragments  of  the  human  genomic  UBRI  DNA  were  isolated  by  using  primers  derived  from  the  1.0-kb  fragment 
of  the  human  UBRI  cDNA  the  Expand  High  Fidelity  PCR  System  (Roche  Molecular  Biochemicals,  Indianapolis,  IN),  and  genomic  DNA  from  human 
293  cells.  The  resulting  four  fragments  were  subcloned  into  pCR2.1  (Invitrogen),  yielding  the  plasmids  HRS,  HR6-4,  HR2-25,  and  HR7-2,  whose  partially 
overlapping  inserts  encompassed  *«21  kb  of  the  human  UBRI  gene.  Partial  sequencing  of  the  mouse  and  human  genomic  Ubrl  fragments  (*«20  kb  of 
sequenced  DNA)  included  all  of  the  exon/intron  junctions  in  these  regions  of  Ubrl. 
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Fig.  3.  Comparison  of  the  deduced  amino  acid  sequence  of  mouse  UBRlp  (Mm-UBRl)  with  those  of  C.  elegans  UBRlp  (Ce-UBRl),  S. 
cerevisiae  Ubrlp  (Sc-UBRl),  and  K.  lactis  Ubrlp  (Kl-UBRl).  White-on-black  and  gray  shadings  highlight,  respectively,  identical  and  similar  residues. 
The  residues  of  UBR  proteins  that  are  identical  to  those  of  S.  cerevisiae  Ubrlp  are  denoted  by  double  dots,  at  positions  where  the  identity  involves 
just  one  non-cerevisiae  protein.  Also  indicated  are  the  regions  of  significant  similarity  among  the  four  proteins.  K.  lactis  UBRl  was  cloned  through 
its  crosshybridization  to  5.  cerevisiae  UBRl  (P.  Waller  and  A.V.,  unpublished  data). 

(CLONTECH),  and  either  mouse  or  human  Ubrl  cDNA  3,150-3,355)  was  used  as  a  template  for  synthesizing  anti¬ 
fragments  labeled  with  were  used  (20).  Southern  hybrid-  sense-  or  sense-strand  RNA  probes  labeled  with  digoxigenin 
izations  were  carried  out  by  using  standard  techniques  (20).  (23). 

Mouse  embryos  were  staged,  fixed,  and  processed  for  in  situ  Chromosome  Mapping  of  the  Mouse  and  Human  Ubrl,  The 

hybridization  as  described  (23).  For  sectioning,  the  stained  mapping  of  mouse  Ubrl  was  carried  out  by  using  the  interspecific 

embryos  were  embedded  in  OCT  medium  (Sakura  Finetek,  backcross  analysis  (24),  essentially  as  described  (18).  Human 

Torrance,  CA).  A  1.2-kb  Ubrl  cDNA  fragment  (nucleotides  UBRJ  was  mapped  by  using  fluorescence  in  situ  hybridization 
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Fig.  4.  Two  Cys/His  domains  of  the 
UBR  protein  family.  Comparison  of  the 
putative  zinc  finger  (region  I)  and 
RING-H2  finger  (region  IV)  with  the 
corresponding  sequences  from  the  other 
species  in  Fig.  3,  and  also  with  C.  albicans 
Ubrlp  (Ca-UBRl),  C  elegans  UBR2p 
(Ce-UBR2),  and  S.  cerevisiac  Ubr2p  (Sc- 
UBR2).  Numbers  indicate  the  lengths  of 
gaps.  The  conserved  Cys  and  His  residues 
are  indicated. 


(FISH)  with  mitotic  chromosomes  from  human  lymphocytes 
(25).  Tlie  probe  was  a  mixture  of  the  HRS,  HR6-4,  HR2-25,  and 
HR7-2  plasmids,  labeled  with  biotin  using  biotinylated  dATP  and 
the  BioNick  labeling  kit  (Life  Technologies,  Grand  Island,  NY), 
and  detected  by  using  fluorescein  isothiocyanate-avidin. 

RESULTS  AND  DISCUSSION 

Isolation  of  the  Mouse  Ubrl  cDNA  Tryptic  peptides  of  the 
purified  rabbit  E3a  (UBRlp)  (16)  were  isolated  and  sequenced 
(sec  Materials  and  Methods),  yielding  14  short  regions  of  E3q:  (Fig. 
I  A).  These  regions  lacked  significant  similarities  to  the  deduced 
sequence  of  S,  cerevisiae  Ubrlp  (13).  We  used  intrapeptide  PCR 
(21)  to  identify  a  unique  (nondegenerate)  sequence  of  the  rabbit 
Uhrl  cDNA.  TTiis  method  allows  amplification  of  a  short  unique 
DN  A  sequence  by  using  two  degenerate  PCR  primers  (derived  by 
reverse  translation)  that  flank  this  sequence  and  correspond  to 
the  outermost  regions  of  a  single  peptide  (Fig.  IB).  Several 
intrapeptide  nucleotide  sequences  were  obtained  this  way  (Fig. 
1).  TTiese  sequences,  together  with  those  of  the  original  degen¬ 
erate  primers,  then  were  used  to  amplify  a  392-bp  fragment  of  the 
rabbit  Ubrl  cDNA,  using  interpeptide  PCR  (Fig.  1).  This  frag¬ 
ment  encoded  peptides  T120  and  T134  at  either  end  and  peptide 
TlOO  in  the  middle  (Fig.  IB).  A  homologous  392-bp  fragment  of 
the  mouse  Ubrl  cDNA  then  was  amplified  by  using  the  same 
method  (Fig.  IB).  The  rabbit  and  mouse  392-bp  Ubrl  cDNA 
fragments  were  88%  and  89%  identical  at  the  nucleotide  and 
amino  acid  sequence  levels,  respectively,  but  lacked  significant 
similarities  to  S.  cerevisiae  UBRl  (data  not  shown). 

The  392-bp  mouse  Ubrl  cDNA  fragment  then  was  used,  in 
conjunction  with  standard  cDNA  library  screening  and  rapid 
amplification  of  cDNA  ends-PCR  (20),  to  isolate  multiple  Ubrl 
cDNA  fragments,  and  to  assemble  them  into  a  5,271-bp  ORF 
encoding  a  1,757-residue  protein  (pi  of  6.0),  whose  size,  200  kDa, 
was  close  to  the  estimated  size  of  the  isolated  rabbit  UBRlp 
(E3a),  ««180  kDa  (16)  (Figs.  2  and  3).  The  inferred  ATG  start 
codon  (Fig.  2),  within  the  sequence  CTTAAGATGGCG,  is 
preceded  by  two  in-frame  stop  codons,  at  positions  -48  and  -93, 
and  is  located  in  a  favorable  Kozak  context  (26),  with  A  and  G 
at  positions  -3  and  -F4,  respectively.  There  are  two  more  ATGs, 
five  and  11  codons  downstream  of  the  inferred  one.  These 
alternative  start  codons  are  in  a  favorable  Kozak  context  as  well. 

Cloning  and  Partial  Characterization  of  the  Mouse  and  Hu¬ 
man  Ubrl  Genes.  A  fragment  of  the  mouse  Ubrl  cDNA  was  used 
to  isolate  a  *»120-kb  mouse  Ubrl  genomic  DNA  clone,  carried  in 
a  BAC  vector  (22).  We  determined  the  exon/intron  organization 
and  restriction  map  of  the  «^31-kb  region  of  Ubrl  that  corre¬ 
sponded  to  the  1,340-bp  5 '-region  of  the  mouse  Ubrl  cDNA 
(nucleotides  105-1,333)  (Fig.  2).  The  lengths  of  the  12  exons  in 
this  region  of  mouse  Ubrl  range  from  63  to  257  bp  (Fig.  2). 

The  nucleotide  and  deduced  amino  acid  sequences  of  the 
1.0-kb  human  UBRl  cDNA  fragment  (see  Materials  and  Meth¬ 
ods),  located  approximately  in  the  middle  of  UBRl  cDNA 
(nucleotides  2,218-3,227  of  the  mouse  Ubrl  cDNA)  (Fig.  2), 
were,  respectively,  91%  and  94%  identical  to  the  corresponding 
mouse  Ubrl  cDNA  and  UBRlp  sequences.  Overlapping  genomic 


DNA  fragments  of  human  UBRl  that,  together,  encompassed  a 
*^21 -kb  region  of  the  human  UBRl  gene  and  corresponded  to  the 
1.0-kb  fragment  of  the  human  UBRl  cDNA  (Fig.  2),  were  isolated 
from  human  DNA  by  using  cDNA-derived  primers  and  PCR. 
Partial  sequencing  showed  that  this  ^2\-kh  region  of  human 
UBRl  contained  1 1  exons  whose  length  ranged  from  49  to  155  bp, 
a  distribution  of  exon  lengths  similar  to  that  in  a  different  region 
of  mouse  Ubrl  (Fig.  2).  All  of  the  sequenced  exon/intron 
junctions  (*«23  exons),  which  encompassed  a  *«52-kb  region  of 
the  mouse  and  human  Ubrl,  contained  the  consensus  GT  and  AG 
dinucleotides  characteristic  of  the  mammalian  nuclear  pre- 
mRNA  splice  sites  (data  not  shown)  (20).  Extrapolating  from 
these  data  on  the  mouse  and  human  Ubrl  genes  and  the  corre¬ 
sponding  regions  of  their  cDNAs  (Fig.  2),  a  mammalian  Ubrl 
gene  is  expected  to  be  *=^120  kb  long  and  to  contain  «=^50  exons. 

The  Mouse  UBR  Ip  Protein  and  its  Homologs.  The  low  overall 
sequence  similarity  of  mouse  UBRlp  (E3a)  to  Ubrlp  of  either  5. 
cerevisiae  [22%  identity  (id.),  48%  similarity  (si.)]  or  another 
budding  yeast,  Kluweromyces  lactis  (21%  id.,  48%  si.),  belied  the 
presence  of  five  regions,  denoted  I-V,  which  were  significantly 
similar  between  the  mouse  and  yeast  versions  of  UBR  Ip  (Figs.  3 
and  4).  By  contrast,  the  Ma/tZ-encoded  asparagine-specific  N- 
terminal  amidase,  the  most  upstream  component  of  the  mouse 
N-end  rule  pathway,  lacks  sequence  similarities  to  its  S.  cerevisiae 
counterpart  Ntalp  (12,  18).  Database  searches  identified  other 
likely  homologs  of  mouse  UBR  Ip,  in  particular  the  1,927-residue 
protein  of  the  nematode  Caenorhabditis  elegans  (GenBank  ac¬ 
cession  no.  U88308)  (32%  id.,  53%  si.;  termed  Ce-Ubrl);  the 
1,872-residue  S.  cerevisiae  protein  (GenBank  accession  no. 
Z73196)  (21%  id.,  47%  si.;  termed  Sc-UBR2;  ref.  4);  the  2,168- 
residue  C.  elegans  protein  (GenBank  accession  no.  U40029)  (21% 
id.,  45%  si.;  termed  Ce-Ubr2);  and  the  794-residue  CER3p 
protein  of  the  phnt  Arabidopsis  thaliana  (GenBank  accession  no. 
X95962)  (26%  id.,  49%  si.).  CER3p  is  involved  in  wax  biosyn¬ 
thesis  in  A.  thaliana  (27).  In  addition,  a  147-residue  sequence  of 
the  yeast  Candida  albicans  (http://alces.med.umn.edu/bin/ 
genelist?LUBRl)  was  similar  to  the  N-terminal  region  of  mouse 
UBRlp  (Fig.  4). 

The  presence  of  high-similarity  regions  I-V  among  these 
deduced  sequences  (Figs.  3  and  4)  suggested  the  existence  of  a 
distinct  protein  family,  termed  UBR.  The  66-residue  region  I, 
near  the  N  terminus  of  UBRlp,  is  a  particularly  clear  UBR 
family-identifying  region  (e.g.,  61%  id.,  75%  si.  between  mouse 
and  C.  elegans  UBRlp)  (Figs.  3  and  4). 

Recent  genetic  analyses  of  5.  cerevisiae  Ubrlp  (N-recognin) 
have  shown  that  the  regions  I-III  contain  residues  essential  for 
the  recognition  of  N-end  rule  substrates  by  Ubrlp.  In  particular, 
Cys-145,  Val-146,  Gly-173,  and  Asp-176  of  region  I  were  identi¬ 
fied  as  essential  residues  of  the  type  1  binding  site  of  S.  cerevisiae 
Ubrlp  (A.  Webster,  M.  Ghislain,  and  A.V.,  unpublished  data). 
All  four  of  these  residues  were  conserved  between  the  yeast, 
mouse,  and  C.  elegans  UBRlp  (Figs.  3  and  4).  Region  I  is  present 
in  all  of  the  known  UBR  family  members  except  CER3p  of  A. 
thaliana,  which  contains  only  regions  IV  and  V  (Fig.  4).  Region 
I  encompasses  a  Cys/His-rich  domain,  Cys-Xt2-Cys-X2-Cys-X5- 


7902  Biochemistry:  Kwon  et  ai 


Proc.  Natl  Acad.  Sci.  USA  95  (1998) 


c 


Fig,  5.  Northern  and  in  situ  hybridizations  with  mouse  and  human  Ubrl.  (A )  Membranes  containing  electrophoretically  fractionated  poly(A)'^ 
mRNA  from  different  mouse  (a-c)  or  human  (d  and  e)  tissues  were  hybridized  with  either  a  2-kb  5'-proximal  (nucleotides  116-2,124)  mouse  UbrJ 
cDNA  fragment  (a),  its  0.64-kb  3'-proximal  (nucleotides  4,749-5,388)  fragment  (b),  a  1-kb  human  UBPI  cDNA  fragment  (d),  or  the  human  p-actin 
cDNA  fragment  (c  and  e).  The  upper  arrows  in  a  and  d  indicate  the  *“8-kb  UbrJ  transcript.  The  lower  arrow  in  a  indicates  the  ^6-kb  testis-specific 
Ubrl  transcript.  In  the  RNA  sample  from  mouse  spleen,  the  Ubrl  transcript  (but  not  the  actin  transcript)  may  have  been  degraded  (fl-c).  (B) 
Expression  of  Ubrl  in  el 0.5  and  el  1.5  mouse  embryos.  Whole-mount  in  situ  hybridization  was  carried  out  with  either  antisense  (AS)  or  sense  (S, 
negative  control)  Ubrl  cDNA  probes  (see  Materials  and  Methods).  The  regions  of  high  Ubrl  expression  are  indicated  by  arrows  (t,  tail;  fl,  forelimb 
buds;  hi,  hindlimb  buds).  The  branchial  arches,  where  Ubrl  is  also  highly  expressed  in  elO.5  embryos  (data  not  shown),  are  not  visible  in  this  elO.5 
embryo.  (C)  Expression  of  Ubrl  in  the  surface  ectoderm  of  limb  buds.  Shown  is  a  transverse  section  of  a  forelimb  bud  of  an  el0,5  embryo  (se, 
surface  ectoderm).  (D)  FISH  analysis  of  human  UBRL  (Upper)  An  example  of  the  -specific  FISH  signal  (arrow).  (Lower)  The  same  mitotic 
spread  stained  with  4'-6-diamino-2-phenylindole  (DAPI)  to  visualize  the  chromosomes  (see  also  Fig,  6). 


Q's-X2-Cys-X2-C^s-X5-His-X2-His-X(i2-14)-Cys-Xi-Cys-Xii-C^s 
(Figs.  3  and  4),  which  is  distinct  from  the  known  consensus 
sequences  of  zinc  fingers  and  other  Cys/His-motifs.  Residues 
Asp-318,  His-321,  and  Glu-560  of  S.  c^revisiae  Ubrlp,  which  have 
been  identified  as  essential  for  the  type  2  binding  site  of  this 
N-recognin  (A.  Webster,  M.  Ghislain,  and  A.V.,  unpublished 
data),  were  found  to  be  retained  in  region  II  (Asp-318  and 
His-321)  and  region  III  (Glu-560)  of  the  mouse  and  C.  elegans 
UBRlp  (Fig.  3). 

Region  IV  contains  another  Cys/His-rich  domain  of  UBRlp, 
Cys-X2’Cys4oop  7-Cys-Xi-His-X2-His-X2-C^s-/oqp  2-Cys-X2-Cys 
(Figs.  3  and  4),  which  is  present  in  all  of  the  UBR  family  members, 
and  fits  the  consensus  sequence  of  the  RING-H2  finger,  a 
subfamily  of  the  previously  defined  RING  motif  (28).  At  least 
some  of  the  RING-H2  sequences  are  sites  of  specific  protein- 
protein  interactions  (28).  Apcllp,  a  subunit  of  the  Ub-protein 
ligase  complex  called  the  cyclosome  (2)  or  the  anaphase  promot¬ 
ing  complex,  also  contains  a  RING-H2  finger  (29). 

Another  area  of  similarity  (24-50%  id.,  46-70%  si.)  among  the 
UBR  family  members  is  region  V  (Fig.  3  and  data  not  shown). 
This  region,  115  residues  long  in  mouse  UBRlp,  near  the 
protein’s  C  terminus,  is  particularly  similar  between  mouse  and  C. 
elegans  UBRlp  (50%  id.,  70%  si.)  (Fig.  3).  Region  V  is  located 
4-14  residues  from  the  UBR  proteins’  C  termini,  the  exceptions 
being  the  S.  cerevisiae  and  K  lactis  Ubrlp,  which  bear,  respec¬ 
tively,  132-  and  159-residue  tails  of  unknown  function  that  are  rich 
in  the  acidic  Asp/Glu  residues  (36%  and  33%)  (Fig.  3).  No 
significant  similarities  could  be  detected  between  mammalian 
UBRlp  and  other  E3s  (recognins)  of  the  metazoan  Ub  ^stem, 


including  E6AP  (30)  and  subunits  of  the  cyclosome/anaphase 
promoting  complex,  except  for  the  presence  of  a  RING-H2  finger 
domain  in  the  latter  (29).  [Different  E3  proteins  of  the  Ub  system 
recognize  different  degrons  in  protein  substrates,  thereby  defin¬ 
ing  distinct  Ub-dependent  proteolytic  pathways  (1,  4).] 

Expression  of  Mouse  and  Human  Ubrl.  The  5'-  and  3'- 
proximal  mouse  cDNA  probes  yielded  similar  results,  detecting  a 
single  «^8-kb  transcript  in  several  tissues  (Fig.  5Aa  and  Ab).  In  the 
testis,  however,  the  *^8-kb  species  of  Ubrl  mRNA  was  a  minor 
one,  the  major  species  being  ^6  kb  (Fig.  5Aa).  The  levels  of  either 
mouse  or  human  Ubrl  mRNA  were  highest  in  skeletal  muscle  and 
heart  (Fig.  5L4).  The  expression  of  mRNA  encoding  E2i4k,  one  of 
the  mouse  Ub-conjugating  (E2)  enzymes  and  a  likely  component 
of  the  mouse  N-end  rule  pathway  (6),  was  also  highest  in  skeletal 
muscle  and  heart  (18). 

The  distinct  Ubrl  mRNA  pattern  in  the  testis  (Fig.  5A)  was 
reminiscent  of  the  analogous  expression  pattern  of  Ntanl 
mRNA,  which  encodes  the  Asn-specific  N-terminal  amidase, 
another  component  of  the  mammalian  N-end  rule  pathway. 
Specifically,  the  size  of  the  major  species  of  Ntanl  mRNA  was 
^lA  kb  in  all  of  the  examined  mouse  tissues  except  testis, 
where  the  major  species  was  ‘=«1.1  kb  (18).  The  *«l.l-kb  Ntanl 
transcript  recently  was  found  to  hybridize  only  to  the  3 ’-half 
(exons  6-10  but  not  exons  1-5)  of  the  Ntanl  ORF  (Y.T.K.  and 
A.V.,  unpublished  data).  The  functional  significance  of  the 
testis-specific  Ubrl  and  Ntanl  expression  patterns  remains  to 
be  understood. 

We  used  whole-mount  in  hybridization  to  examine  the 
expression  of  Ubrl  during  embryogenesis.  In  e9.5  (9.5  days  old) 
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Fig.  6.  Chromosomal  locations  of  the  mouse  and  human  Vhrl 
genes.  (A )  Mouse  Ubrl  was  mapped  to  the  middle  of  mouse  chromo¬ 
some  2  by  using  interspecific  (Af.  musculus-M.  spretus)  backcross 
analysis  (18,  24).  Shown  are  the  segregation  patterns  of  mouse  Ubrl 
and  the  flanking  genes  in  66  backcross  animals  that  were  typed  for  all 
loci.  For  individual  pairs  of  loci,  more  than  66  animals  were  typed. 
Each  column  represents  the  chromosome  identified  in  the  backcross 
progeny  that  was  inherited  from  the  \M.  musculus  C57BL/6J  X  M. 
.spretus]  Fi  parent.  Filled  and  empty  squares  represent,  respectively, 
C57BL/6J  and  Af.  spretus  alleles.  The  numbers  of  offspring  that 
inherited  each  type  of  chromosome  2  are  listed  below  the  columns.  (B) 
A  partial  mouse  chromosome  2  linkage  map  (MMU2),  showing  Ubrl 
in  relation  to  the  linked  genes  Thbsl,  Ebp4.2,  and  B2m,  and  also,  on 
the  left,  the  corresponding  recombination  distances  between  the  loci, 
in  centimorgans,  and  the  map  locations,  in  parentheses.  (C)  A  partial 
human  chromosome  15  linkage  map  (HSA15).  Each  dot  on  the  right, 
in  the  15ql5-q21.1  region,  corresponds  to  the  actually  observed 
-specific  double-dot  FISH  signal  detected  on  human  chromo¬ 
some  15  (see  also  Fig.  5Z>). 


mouse  embryos,  the  expression  of  Ubrl  was  highest  in  the 
branchial  arches  and  in  the  buds  of  forelimbs  and  the  tail  (data 
not  shown).  In  elO.5  embryos,  the  expression  of  Ubrl  became 
high  in  the  hindlimb  buds  as  well  (Fig.  5B).  This  pattern  was 
maintained  in  the  limb  buds  of  el  1.5  embryos  (Fig.  5B).  High 
expression  of  Ubrl  in  the  limb  buds  was  confined  predominantly 
to  the  surface  ectoderm  (Fig.  5C).  This  pattern  of  Ubrl  expression 
in  embryos  (Fig.  5  B  and  C)  is  similar,  if  not  identical,  to  that  of 
Ntanl,  which  encodes  asparagine-specific  N-terminal  amidase 
(18)  (Y.T.K.  and  A.V.,  unpublished  data),  consistent  with 
UBRlp  and  NTANlp  being  components  of  the  same  pathway. 

The  enhanced  expression  of  Ubrl  in  the  embryonic  limb  buds 
(Fig.  5  B  and  C)  is  interesting  in  view  of  the  conjecture  that  the 
N-end  rule  pathway  might  be  required  for  limb  regeneration  in 
amphibians  (31).  The  injection  of  dipeptides  bearing  destabilizing 
N-terminal  residues  into  the  stumps  of  amputated  forelimbs  of 
the  newt  was  observed  to  delay  limb  regeneration,  whereas  the 
injection  of  dipeptides  bearing  stabilizing  N-terminal  residues  had 
no  effect  (31).  Rigorous  tests  of  this  and  other  suggested  functions 
of  the  metazoan  N-end  rule  pathway  (6)  will  require  mouse 
strains  that  lack  Ubrl, 

Chromosome  Mapping  of  Mouse  and  Human  Ubrl.  The 
chromosomal  location  of  mouse  Ubrl  was  determined  by  inter¬ 
specific  backcross  analysis,  using  DNA  derived  from  matings  of 
[(C57BL/6J  X  Mus  spretus)F I  X  C57BL/6J]  mice  (Fig.  6  A  and 
B)  (18,  24).  Mouse  Ubrl  is  located  in  the  central  region  of 
chromosome  2  and  is  linked  to  the  Thbsl,Epb4.2,  and  B2m  genes, 
the  most  likely  gene  order  being  ctnXxomtrQ-Thbsl-Ubrl-Epb4,2- 
B2m  (Fig.  6B  and  data  not  shown). 

The  chromosomal  location  of  human  UBRl  was  determined 
by  using  FISH  (25),  with  human  UBRl  genomic  DNA  frag¬ 
ments  as  probes  (Figs.  5D  and  6C).  This  mapping  placed  UBRl 
at  the  15ql5-15q21.1  region  of  the  human  chromosome  15,  an 
area  syntenic  with  the  independently  mapped  position  of 
mouse  Ubrl  (Fig.  6).  Ubrl  is  located  in  the  regions  of  human 
chromosome  15  and  mouse  chromosome  2  that  appear  to  be 
devoid  of  the  previously  mapped  but  uncloned  mutations. 


Mutations  in  the  human  gene  CANP3,  which  encodes  a  subunit 
of  calpain  and  is  located  very  close,  if  not  adjacent,  to  UBRl, 
have  been  shown  to  cause  a  myopathy  called  the  limb-girdle 
muscular  distrophy  (32). 

Concluding  Remarks.  Isolation  of  the  mouse  and  human  Ubrl 
cDNAs  and  genes  (Figs,  2-6)  should  enable  functional  under¬ 
standing  of  the  mammalian  N-end  rule  pathway,  in  part  through 
the  construction  and  analysis  of  mouse  strains  that  lack  Ubrl. 
Recent  searches  in  GenBank  identified  several  mouse  and  human 
sequences  in  expressed  sequence  tag  databases  that  exhibited 
significant  similarity  to  the  C-terminal  region  of  mouse  UBRlp. 
The  cloning  and  characterization  of  the  corresponding  cDNAs 
have  shown  that  there  exist  at  least  two  distinct  mouse  (and 
human)  genes,  termed  Ubr2  and  Ubr3,  which  encode  proteins  that 
are  significantly  similar  to  mouse  UBRlp  (Y.T.K.  and  A.V., 
unpublished  data).  Molecular  and  functional  analyses  of  these 
Ubrl  homologs  are  under  way. 
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Ubiquitin-dependent  degradation  of  intracellular 
proteins  underlies  a  multitude  of  biological  processes, 
mcluding  the  cell  cycle,  cell  differentiation,  and  re¬ 
sponses  to  stress.  One  ubiquitin-dependent  proteo- 
l3rtic  system  is  the  N-end  rule  pathway,  whose  targets 
include  proteins  that  bear  destabilizing  N-terminal 
residues.  This  pathway,  which  has  been  characterized 
only  in  somatic  cells,  is  shown  here  to  be  present  also 
in  germ  line  cells  such  as  the  eggs  of  the  amphibian 
Xenopus  laevis.  We  demonstrate  that  the  set  of  desta¬ 
bilizing  residues  in  the  N-end  rule  pathway  of  Xenopus 
eggs  is  similar,  if  not  identical,  to  that  of  somatic  cells 
such  as  mammalian  reticuloc3rtes  and  fibroblasts.  It  is 
also  shown  that  the  degradation  of  engineered  N-end 
rule  substrates  in  egg  extracts  can  be  strongly  and 
selectively  inhibited  by  dipeptides  bearing  destabiliz¬ 
ing  N-terxninal  residues.  This  result  allowed  us  to  ask 
whether  selective  inhibition  of  the  N-end  rule  path¬ 
way  in  egg  extracts  infiuences  the  apoptosis-like 
changes  that  are  observed  in  these  extracts.  A  dipep¬ 
tide  bearing  a  bulky  hydrophobic  (type  2)  destabiliz¬ 
ing  N-terminal  residue  was  found  to  delay  the  apopto- 
tic  changes  in  egg  extracts,  whereas  dipeptides  bear¬ 
ing  basic  (type  1)  destabilizing  N-terminal  residues 
had  no  effect.  High  activity  of  the  N-end  rule  pathway 
in  egg  extracts  provides  an  alternative  to  reticulocyte 
extracts  for  the  in  vitro  analyses  of  this  pathway. 

O  1998  Aeademic  Press 

Key  Words:  ubiquitin;  proteolysis;  N-end  rule;  apop¬ 
tosis;  germ  line;  Xenopus. 


A  number  of  regulatory  circuits,  mcluding  those  that 
control  the  cell  cycle,  cell  differentiation,  and  responses 
to  stress,  involve  metabolically  unstable  proteins  (1- 
4).  A  short  in  vivo  half-life  of  a  regulator  provides  a  way 
to  generate  its  spatial  gradients  and  allows  for  rapid 


adjustments  of  its  concentration,  or  subunit  composi¬ 
tion,  through  changes  in  the  rate  of  its  synthesis  or 
degradation.  Damaged  or  otherwise  abnormal  proteins 
tend  to  be  short-lived  as  well. 

Features  of  proteins  that  confer  metabolic  instability 
are  called  degradation  signals,  or  degrons  (5).  The  es¬ 
sential  component  of  one  degradation  signal,  called  the 
N-degron,  is  a  destabilizing  N-terminal  residue  of  a 
protein  (6,  7).  The  set  of  amino  acid  residues  which  are 
destabilizing  in  a  given  cell  yields  a  rule,  called  the 
N-end  rule,  which  relates  the  in  vivo  half-life  of  a 
protein  to  the  identity  of  its  N-terminal  residue.  Simi¬ 
lar  but  distinct  versions  of  the  N-end  rule  pathway  are 
present  in  all  organisms  examined,  from  mammals  to 
fungi  and  bacteria  (7). 

In  eukaryotes,  an  N-degron  comprises  at  least  two 
determinants:  a  destabilizing  N-terminal  residue  and  an 
internal  lysine  or  lysines  (8-10).  The  Lys  residue  is  the 
site  of  formation  of  a  multiubiquitin  chain  (11).  The  N- 
end  rule  pathway  is  thus  a  part  of  the  ubiquitin  (Ub)^ 
S3rstem.  Ub  is  a  76-residue  eiakaiyotic  protein  whose  co¬ 
valent  conjugation  to  other  proteins  plays  a  role  in  a 
multitude  of  biological  processes  (1,  12-15).  In  most  of 
them,  Ub  acts  through  routes  that  involve  the  degrada¬ 
tion  of  Ub-protein  conjugates  by  the  26S  proteasome,  an 
ATP-dependent  multisubunit  protease  (16-18). 

It  was  found  that  linear  Ub  fusions  are  rapidly 
cleaved  by  Ub-specific  proteases  (UBPs)  at  the  Ub- 
protein  junction,  making  possible  the  production  of 
otherwise  identical  proteins  bearing  different  N-termi¬ 
nal  residues.  This  technical  advance  led  to  the  discov¬ 
ery  of  the  N-end  rule  pathway  (6,  7).  The  N-end  rule  is 
organized  hierarchically.  In  the  yeast  Saccharomyces 
cerevisiae,  Asn  and  Gin  are  tertiary  destabilizing  N- 
terminal  residues  in  that  they  function  through  their 
conversion,  by  en23nmatic  deamidation,  into  the  second- 


^  Abbreviations  used:  Ub,  ubiquitin;  DHFR,  mouse  dihydrofolate 
^  To  whom  correspondence  should  be  addressed  at  Division  of  reductase;  R*transferase,  Arg-tRNA-protetin  transferase;  CSF,  cyto- 
Biology,  147-75,  Caltech,  1200  East  California  Boulevard,  Pasadena,  static  factor;  TCA,  trichloroacetic  acid;  DTT,  dithiothreitol;  AMC, 
CA  91125.  Fax:  (626)  440-9821.  E-mail:  avarsh@cco.caltech.edu.  7-amino-methylcoumarin;  UBP,  Ub-spedfic  processing  protease. 
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ary  destabilizing  N-terminal  residues  Asp  and  Glu. 
The  destabilizing  activity  of  N-terminal  Asp  and  Glu 
requires  their  conjugation,  by  Arg-tRNA-protein  trans¬ 
ferase  (R-transferase),  to  Arg,  one  of  the  primary  de¬ 
stabilizing  residues  (19-21).  The  primary  destabilizing 
N-terminal  residues  are  boimd  directly  by  N-recognin 
(also  called  E3),  the  recognition  component  of  the  N- 
end  rule  pathway.  In  S.  cerevisiae,  N-recognin  is  a 
225-kDa  protein,  encoded  by  UBRl,  that  targets  poten¬ 
tial  N-end  rule  substrates  through  the  binding  to  their 
primary  destabilizing  N-terminal  residues  —  Phe,  Leu, 
Trp,  Tyr,  Re,  Arg,  Lys,  or  His  (7,  22).  N-recognin  has  at 
least  two  substrate-binding  sites.  The  type  1  site  is 
specific  for  the  basic  N-terminal  residues  Arg,  Lys,  and 
His.  The  type  2  site  is  specific  for  the  bulky  hydropho¬ 
bic  N-terminal  residues  Phe,  Leu,  Trp,  Tyr,  and  lie  (7). 

The  currently  known  physiological  substrates  of  the 
N-end  rule  pathway  are  the  RNA  polymerase  of  Sindbis 
virus  (23),  the  GPAi -encoded  Ga  subunit  of  the  S.  cerevi¬ 
siae  heterotrimeric  G  protein  (24),  and  Cup9p,  a  short¬ 
lived  transcriptional  repressor  in  S.  cerevisiae  that  con¬ 
trols  the  expression  of  PTR2,  which  encodes  a  peptide 
transporter  (25).  The  Ubrlp-Cup9p-Ptr2p  circuit,  which 
controls  the  import  of  peptides  in  yeast,  is  the  first  clear 
mgtnnrp  of  a  physiological  function  of  the  N-end  rule 
pathway  (25).  Among  the  cells  of  multicellular  organ¬ 
isms,  this  proteolytic  system  was  characterized  in  rabbit 
reticulocytes  (19)  and  L  cells,  a  line  of  fibroblast-like 
mouse  cells  (26).  Recently,  it  was  suggested  tlmt  the 
N-end  rule  pathway  might  play  a  role  in  apoptosis  (pro¬ 
grammed  cell  death)  (7).  A  way  to  verify  this  conjecture 
would  be  to  use  a  mutant  t^t  lacks  the  N-end  rule 
pathway.  Such  mutants  were  constructed  and  character¬ 
ized  in  Escherichia  coli  and  S.  cerevisiae  (7)  but  not  yet  in 
multicellular  organisms.  Therefore  we  considered  the  use 
of  N-end  rule  inhibitors. 

Previous  work  has  shown  that  the  addition  of  amino 
acid  derivatives  such  as  dipeptides  that  bear  destabi¬ 
lizing  N-terminal  residues  to  reticulocyte  extract  re¬ 
sults  in  a  strong  and  selective  inhibition  of  the  N-end 
rule  pathway  in  the  extract  (19, 27).  Specifically,  dipep¬ 
tides  bearing  type  1  destabilizing  N-terminal  residues 
inhibited  the  degradation  of  test  N-end  rule  substrates 
bearing  basic  (type  1)  destabilizing  N-terminal  resi¬ 
dues  but  had  no  effect  on  the  degradation  of  substrates 
bearing  type  2  N-terminal  residues.  A  converse  pattern 
was  observed  with  dipeptides  bearing  bulky  hydropho¬ 
bic  (type  2)  destabilizing  N-terminal  residues  (19). 
Dipeptides  added  to  S.  cerevisiae  cultures  have  been 
demonstrated  to  inhibit  the  N-end  rule  pathway  in  vivo 
as  well  (28).  However,  experiments  with  the  N-end  rule 
pathway  of  intact  mammalian  cells  have  shown  dipep¬ 
tides  to  be  ineffective  inhibitors  in  this  setting®. 


^  F.  Levy  and  A.  Varshavsky,  unpublished  data. 


Given  this  latter  constraint,  we  sought  to  explore  the 
physiological  consequences  of  inhibiting  a  metazoan 
N-end  rule  pathway  in  the  inhibitor- accessible  setting 
of  a  cell-free  system.  We  sdso  wished  to  determine 
whether  the  N-end  rule  pathway  is  present  in  germ 
line  cells  such  as  the  eggs  of  the  amphibian  Xenopus 
laevisy  a  major  experimental  organism  in  studies  of 
embryogenesis  and  cell  cycle  control  (4,  29).  Our  find¬ 
ings  are  described  below. 

MATERIALS  AND  METHODS 

Construction  of  plasmids.  The  plasmids  pT7‘UbMe^DHFRhis 
and  pT7-UbRe^DHFRhis  expressed  Ub-Met-e^-DHFR-Hisg  and  Ub- 
Arg-e^-DHFR-Hisg  fusions  (denoted,  respectively,  as  Ub-Met-DHFR 
and  Ub-Arg-DHFR)  in  E.  coli  from  the  T7  polymerase  promoter  (see 
Results  for  the  definitions  of  e*^  and  other  terms).  These  plasmids 
were  constructed  from  pEJJl-M  and  pEJJl-R  (30).  The  Kpnl-Hind- 
III  fragment  of  pEJJl-M  and  pEJJl-R  was  replaced  with  a  synthetic 
oligonucleotide  dupl^  (5'-CCATCACCATCACCATCACTAAA“3  and 
5'-AGCTTTTAGTGATGGTGATGGTGATCK}GTAC-3')  that  encoded 
the  Hisg  tag  and  bore  the  Kpnl  and  Hi/idlll  overhangs.  The  other 
pT7-UbXe*^DHFRhis  plasmids,  which  encoded  otherwise  identi¬ 
cal  Ub-X-DHFR  fusions  bearing  different  X  residues  (Met,  Arg, 
Leu,  Phe,  Cys,  Asp,  or  Asn)  (Fig.  1),  were  constructed  from  pT7- 
UbRe^DHFRhis  by  site-directed  mutagenesis,  using  PCR  (31). 

Overexpression,  labeling,  and  purification  of  Ub-X^e^-DHFR-Hisg 
proteins.  A  pT7-UbXe^DHFRhis  plasmid  was  introduced  into  E. 
coli  BL21  (DE3)  (31).  Protein  expression  was  induced  in  E,  coli  by 
adding  isopropyl-l-thio-P-D-galactopyranoside  (EPTG),  and  ceils 
were  labeled  with  P^Slmethionine/cysteine  (®®S-Express,  New  En¬ 
gland  Nuclear,  Boston,  MA).  as  described  (30).  The  labeled  cells  were 
collected  by  centrifugation  and  disrupted  by  sonication,  and  a  ^®S- 
labeled  Ub-X-e^-DHFR-Hisg  (Ub-X-DHFR)  test  protein  was  purified 
bv  affinity  chromatography  under  nondenaturing  conditions,  using 
the  Ni-NTA  Spin  Kit  (Qiagen,  Chatsworth,  CA),  and  dialyzed  against 
1  mM  MgCU,  1  mM  dithiothreitol  (DTT),  0.1  M  Tris-HCl  (pH  7.7), 
fiozen  rapidly,  and  stored  at  —SO^C  in  samples  that  were  to  be 
thawed  only  once.  The  specific  radioactivity  of  [®®S]Ub-X-DHFR  pro¬ 
teins  was  5-10  X  10®  cpm/pg. 

Degradation  assays  in  Xenopus  egg  extracts,  C3ftostatic  factor 
(CSF)-arrested  egg  extracts  were  prepared  from  Xenopus  eggs  as  de¬ 
scribed  (32).  In  most  experiments,  cycloheximide  (0.1  mg/ml)  was  added 
to  the  extracts  to  preclude  reincorporation  of  P®S)  methionine  into  newly 
made  proteins.  In  some  experiments  (see  the  legends  to  figures),  egg 
extracts  were  activated  by  the  addition  of  0.4  znM  CaCl2  1  h  before  the 
assay.  P®S]Ub-X-DHFR  test  proteins  were  added  to  the  extract  to  the 
final  concentration  of  ^25  pg/ml.  Dipeptides  were  added  together  with 
bestatin  (Sigma,  St.  Louis,  MO)  to  the  final  concentrations  of  10  mM 
^nd  50  /ig/ml,  respectively.  Bestatin  was  added  to  decrease  the  degra¬ 
dation  of  dipeptides  in  the  extract  (27).  (Control  experiments  (not 
shown)  showed  that  the  addition  of  bestatin  alone  did  not  significantly 
inhibit  the  degradation  of  test  proteins  by  the  N-end  rule  pathway.)  The 
following  dipeptides  and  other  amino  add  derivatives  were  used:  Arg- 
P-Ala,  Ala-Lys,  Lys-Ala,  Trp-Ala  (Sigma),  and  Tyr-His  (Bachem  Sd- 
ence,  King  of  Prussia,  PA).  Stock  samples  of  dipeptides  were  0.5  M 
solutions  in  10  mM  K-Hepes,  pH  7.5. 

To  follow  the  degradation  of  test  proteins,  reaction  mixtures  (0.1 
ml)  prepared  as  described  above  were  incubated  at  23® C.  Samples 
(2.5  fiJ)  were  withdrawn  in  duplicate  for  each  time  point.  One  sample 
was  examined  by  SDS-12%  PAGE  and  autoradiography,  using  Phos- 
phorlmager  (Molecular  D3nnamics,  Sunnyvale,  CA)  (33).  The  other 
sample  was  used  to  determine  the  relative  amount  (%)  of  ^S  soluble 
in  5%  trichloroacetic  add  (TCA)  (30).  This  parameter  was  calculated 
as  follows: 
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FIG.  1.  The  Ub-X-DHFR  test  proteins.  These  fusions,  identical  except  for  the  variable  residue  X  adjacent  to  the  Ub  moiety,  were  expressed 
and  radiolabeled  in  E.  coli,  purified,  and  used  as  described  under  Materials  and  Methods.  X  was  either  Met,  Arg,  Leu,  Phe,  Cys,  Asp,  or  Asn. 
The  E.  coil  Lac  repressor-derived  42-residue  sequence,  denoted  as  e^  (extension  (e)  containing  lysines  (K))  was  described  previously  (8,  30, 
37).  The  alternative  ubiquitylation  sites  Lys-15  and  Lys-17  are  boxed  in  the  sequence  of  e^  (8).  DHFR  represents  DHFR-Hisg,  the  mouse 
DHFR  whose  C-terminus  was  extended  by  8  residues  that  included  the  HiSg  tag. 


-X'  a 

percent  TCA-soluble  =  p  ‘  ^  ^  •  100%, 

where  X  is  the  amount  of  TCA-soluble  “S  (cpm);  Y  is  the  total 
amount  of  (cpm)  in  the  same  sample;  a  is  the  number  of  Met 
residues  in  a  Ub-X-DHFR  test  protein  (a  =  10  for  Ub-Met-DHFR;  a  = 
9  for  the  other  Ub-X-DHFRs).  The  a  -1  term  above  corrects  for  the 
presence  of  one  Met  residue  in  Ub. 

The  data  presented  in  each  of  the  experimental  Figures  were 
produced  using  the  same  extract  on  the  same  day.  Unless  stated 
otherwise,  the  data  were  consistent  between  at  least  two  (usually 
three  or  more)  independent  experiments  carried  out  with  different 
preparations  of  the  extract. 

In  vitro  apoptosis  assays^  Apoptotic  extracts  firom  Xenopus  eggs 
were  prepared  according  to  Newmeyer  et  aL  (34).  The  difference 
between  control  and  apoptotic  egg  extracts  was  in  the  time  intervals 
between  hormone  injectiohs  used  to  produce  the  eggs.  Frogs  were 
injected  with  75  units  of  pregnant  mare  serum  gonadotropin  (Cal- 
biochem,  San  Diego,  CA)  14-30  days  (in  contrast  to  3-10  days  for 
control  extracts)  before  they  were  induced  to  lay  eggs  by  injecting  800 
units  of  human  chorionic  gonadotropin  (Sigma).  Eggs  were  collected, 
and  the  extract  was  prepared  as  described  (32),  The  extracts  were 
used  within  1  h  after  preparation.  For  most  assays,  the  extracts  were 
arrested  in  interphase  by  the  addition  of  0.4  xnM  CaCL  and  0.1 
mg/ml  cycloheximide.  DemembranatedXe/iopus  sperm  nuclei  (32),  at 
—1000  per  pi  of  extract,  were  also  added  at  liiis  time.  Extracts 
treated  as  described  were  distributed  into  0.1-ml  samples  and  incu¬ 
bated  at  23°C.  Apoptotic  changes  in  the  extracts  were  monitored  in 
two  wa3rs:  by  observing  shrinkage  and  disintegration  of  the  added 
sperm  nuclei  using  phase  contrast  and  fluorescent  microscopy  (34) 
and  by  measuring  the  DEVD-specific  protease  activity.  The  latter 
assay  determined  the  total  activity  of  caspases  that  recognize  the 
tetrapeptide  sequence  DEVD  (Asp-Glu-Val-Asp)  (35,  36). 

The  DEVD-specific  protease  activity  was  measured  by  incubating 
5  pi  of  extract  with  a  quenched  fluorescent  substrate  Ac-DEVD-AMC 
(50  pM)  (Bachem  Biosdence)  in  0.1  ml  of  the  extract  buffer  (15  mM 
MgCl2,  1  mM  DTT,  20  mM  EGTA,  80  mM  Na-j3-giycerolphosphate, 
pH  7.3).  The  reaction  was  carried  out  for  10  min  at  23® C,  followed  by 
freezing  of  a  20-pl  sample  in  liquid  Nj.  Samples  were  thawed  by 
diluting  into  1  ml  of  phosphate-buffered  saline,  and  their  fluores¬ 
cence  (excitation  at  380  nm  and  emission  at  460  nm)  was  measured 
in  the  Hitachi  F-4500  spectrofluorimeter  (35,  36).  AMC  was  used  as 
a  fluorescence  standard. 

RESULTS 

The  N-End  Rule  Pathway  in  Xenopus  Egg  Extracts 

Among  the  cells  of  multicellular  oigjanisms,  the  N-end 
rule  pathway  has  been  identified  and  analyzed  in  rabbit 


reticulocytes  (19)  and  L  cells,  a  line  of  transformed,  fibro¬ 
blast-like  mouse  cells  (26).  To  determine  whether  the 
N-end  rule  pathway  is  present  in  germ  line  cells  such  as 
Xenopus  eggs,  we  used  an  approach  similar  to  the  earlier 
one  with  rabbit  reticulocytes  (19,  30). 

A  set  of  N-end  rule  substrates  differing  exclusively 
by  their  N-terminal  residues  was  constructed,  ex¬ 
pressed  in  E.  coli,  metabolically  labeled  with  [^®S]  me¬ 
thionine,  and  purified  by  affinity  chromatography  (see 
Materials  and  Methods).  A  test  protein  of  this  set, 
Ub-X-e^-DHFR-Hise,  denoted  below  as  Ub-X-DHFR, 
contained  the  following  parts:  the  N-terminal  Ub  moi¬ 
ety;  a  variable  residue  X;  e^,  a  42-residue,  E.  coli  Lac 
repressor-derived  sequence  that  contained  the  second 
(lysine)  determinant  of  the  N-degron  (8,  37);  and  the 
Hisg-tagged  mouse  dihydrofolate  reductase  moiety 
(Fig.  1).  Ub  fusions  are  not  cleaved  at  the  Ub-protein 
junction  in  E.  coli,  which  lacks  the  Ub  system  (38).  By 
contrast,  in  eukaryotes  the  Ub  fusions,  including  the 
Ub-X-DHFRs,  are  rapidly  cleaved  by  UBPs  after  the 
last  residue  of  Ub,  making  it  possible  to  produce,  in 
vivo  or  in  vitro,  otherwise  identical  test  proteins  such 
as  X-DHFRs  that  differ  exclusively  by  their  N-terminal 
residues  (6,  7). 

CSF-arrested  Xenopus  egg  extracts  were  prepared 
from  unfertilized  frog  eggs  (32).  Specific  ^^S-labeled 
Ub-X-DHFR  test  proteins  (Fig.  1)  were  added  to  the 
egg  extracts,  and  their  metabolic  fates  were  monitored 
either  by  SDS-PAGE  and  autoradiography  or  by  mea¬ 
suring  the  amount  of  TCA-soluble  released  during 
incubation  at  23®C.  The  identity  of  residue  X  in  the 
tested  X-DHFRs  encompassed  at  least  one  representa¬ 
tive  of  each  class  of  N-terminal  destabilizing  residues: 
Asn  (tertiary  destabilizing);  Asp  and  Cys  (secondary 
destabilizing),  Arg  (type  1  primary  destabilizing),  Phe 
and  Leu  (type  2  primary  destabilizing),  and  Met  (sta¬ 
bilizing)  (see  introduction)  (7). 

As  expected,  a  Ub-X-DHFR  test  protein  was  deubiq- 
uitylated  upon  its  addition  to  the  extract.  This  was 
detected  through  an  increase  in  the  electrophoretic 
mobility  of  the  major  ^®S-labeled  species  and  the  con- 
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FIG.  2.  Deubiquitylation  and  degradation  of  Ub-X-DHFR  test  proteins  in  Xenopus  egg  extract.  (A)  ^®S-labeled,  purified  Ub-Arg-DHFR  was 
added  to  a  CSF-arrested  Xenopus  egg  extract  and  incubated  at  23®C  for  the  indicated  times.  The  samples  were  analyzed  by  SDS-PAGE,  and 
the  labeled  species  were  detected  by  autoradiography  (see  Materials  and  Methods).  (B)  Same  as  A,  but  with  Ub-Phe-DHFR.  (C)  Same  as  A, 
but  with  Ub-Met-DHFR  (only  the  area  around  Met-DHFR  is  shown).  (D)  Same  as  C,  but  with  Ub-Asp-DHFR.  (E)  Same  as  C,  but  with 
Ub-Asn-DHFR,  Open  arrowheads  indicate  the  bands  of  the  initial,  34-kDa  Ub-X-DHFR.  Closed  arrowheads  indicate  the  bands  of  deubiq- 
uitylated,  26-kDa  X-DHFR.  An  asterisk  indicates  the  band  of  firee  [^®S]Ub,  which  migrated  close  to  the  dye  front  in  this  electrophoretic 
system.  The  molecular  masses  (in  kDa)  of  protein  markers  are  indicated  to  the  right  of  B. 


comitant  appearance  of  a  labeled  *^8  kDa  species  (Fig. 
2  and  data  not  shown).  Deubiquitylation  of  Ub-X- 
DHFR  in  the  extract  was  essentially  complete  in  5  min 
or  less  (Fig.  2).  Whereas  the  relative  intensity  of  the 
free  Ub  band  remained  nearly  constant  during  the 
subsequent  4-h  incubation  (Figs.  2A  and  2B,  and  data 
not  shown),  the  relative  intensity  of  the  X-DHFR  band 
was  either  constant  or  decreased,  at  a  var3dng  rate, 
depending  on  the  identity  of  the  N-terminal  residue  X. 
Based  on  their  relative  metabolic  stabilities  in  the  ex¬ 
tract  (Figs.  2  and  3A,  and  data  not  shown),  the  X- 
DHFR  test  proteins  could  be  reproducibly  ranked  from 
the  long-hved  to  the  short-lived  as  follows:  Met  > 
Cys  >  Leu  >  Asp  >  Asn  >  Phe  >  Arg.  We  conclude 
that  Xenopus  egg  extracts  contain  an  N-end  rule  path¬ 
way  whose  rule  book  is  similar,  and  may  be  identical, 
to  those  found  in  rabbit  reticulocytes  (19)  and  mouse  L 
cells  (26). 

The  decay  curves  of  X-DHFRs  that  were  determined 
by  monitoring  the  release  of  TCA-soluble  and  the 
resulting  ranking  of  their  N-terminal  destabilizing  res¬ 
idues  were  reproducible  between  experiments  that  uti¬ 
lized  independent  preparations  of  the  egg  extract  (data 
not  shown).  Specifically,  it  took  min  to  destroy 
50%  of  Arg-DHFR  to  acid-soluble  fragments.  By  30 
min,  the  degradation  of  Arg-DHFR  was  ^^90%  com¬ 
plete  (Figs.  2A  and  3A). 

Approximately  20%  of  the  added  Met-DHFR,  which 
bore  a  stabilizing  N-terminal  residue  (7),  was  degraded 
during  the  first  10  min  of  incubation,  and  another  5  to 


15%  was  degraded  over  the  next  30  min.  However,  the 
rest  of  the  Met-DHFR  remained  stable  in  the  extract 
(Figs.  2C  and  3A).  A  likely  explanation  of  this  result  is 
that  our  preparation  contained  two  t3Tpes  of  Met-DHFR 
molecules:  --70%  were  undamaged  and  long-lived  in 
the  extract,  as  observed  with  Met-DHFR  in  vivo,  in 
both  S,  cerevisiae  and  mouse  L  cells  (Ref.  8;  F.  Levy  and 
A.  V.,  unpubl.  data),  whereas  the  remainder  comprised 
misfolded  or  otherwise  damaged  Met-DHFR  molecules 
that  were  degraded  by  a  proteolytic  system  distinct 
from  the  N-end  rule  pathway.  By  inference,  the  same 
background  of  degradation  of  a  subpopulation  of  mol¬ 
ecules  would  be  expected  for  other  X-DHFRs.  It  is 
unclear  whether  a  subpopulation  of  misfolded  or  oth¬ 
erwise  daimaged  X-DHFRs  resulted  from  their  overex¬ 
pression  in  E.  coli,  or  whether  the  damage  occurred 
largely  during  purification  of  X-DHFRs.  Independent 
evidence  for  this  explanation  of  the  background  degra¬ 
dation  of  Met-DHFR  is  presented  below. 

Previous  work  has  shown  that  N-terminal  Cys  is  a 
stabilizing  residue  in  S.  cerevisiae  (8),  but  is  a  second¬ 
ary  destabilizing  residue  in  rabbit  reticulocyte  extracts 
and  mouse  L  cells  (19,  26).  In  Xenopus  egg  extracts, 
Cys  was  foimd  to  be  a  weakly  destabihzing  residue. 
Specifically,  among  the  tested  X-DHFRs,  Cys-DHFR 
was  the  longest-lived  test  protein  save  for  Met-DHFR, 
which  bore  a  stabilizing  N-terminal  residue  (Fig.  3A). 
Substrates  that  bear  secondary  or  tertiary  destabiliz¬ 
ing  residues  require  the  tRNA-dependent  activity  of 
R-transferase  for  their  degradation  by  the  N-end  rule 
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FIG.  3.  Degradation  of  X-DHFR  test  proteins  in  egg  extracts  under  different  conditions,  as  determined  by  measuring  TCA-soiuble  (A) 
^®S“labeled»  purified  Ub-Arg-DHFR  (G),  Ub-Phe-DHFR  (O),  Ub-Met-DHFR  (A),  Ub-Asp-DHFR  (C),  and  Ub-Asn-DHFR  (ffl)  were  added  to 
samples  of  the  CSF-arrested  Xenopus  egg  extract.  After  incubation  at  23*C  for  the  indicated  times,  the  amounts  of  soluble  in  5%  TCA  were 
determined  for  each  time  point  (see  Materials  and  Methods).  Since  the  free  P®S]Ub,  produced  through  deubiquitylation  of  a  Ub-X-DHFR,  was 
stable  during  the  incubation,  100%  of  TCA-soluble  “S  represented  complete  degradation  of  the  Ub-lacking  P^SIX-DHFR  (see  Materials  and 
Methods  for  further  details).  (B)  Pretreatment  of  egg  extract  with  RNAase  selectively  inhibits  degradation  of  N-end  rule  substrates  bearing 
secondary  destabilizing  N>terminal  residues.  A  CSF-arrested  extract  was  prerreated  with  5  unite/ml  of  RNAase  A-agarose  (Sigma)  for  1  h  at 
23®C,  followed  by  the  removal  of  RNAase-agarose  by  centrifugation  (closed  symbol  curves).  Another  sample  of  the  same  extract  was  incubated 
for  1  h  without  RNAase  (open  symbol  curves).  ^®S*labeled  Ub-C3rs-DHFR  itriangles)y  Ub-Asp-DHFR  icircles)^  or  Ub-Arg-DHFR  (squares)  were 
added  to  both  extracts,  and  the  relative  amounts  of  TCA-soluble  ^*S  released  during  the  incubation  were  determined  as  in  A.  (C)  Different 
states  of  egg  extract  do  not  significantly  alter  the  activity  of  the  N-end  rule  pathway.  ^®S-labeled  Ub-Phe-DHFR  was  added  to  the 
CSF-arrested  egg  extract  (□),  to  the  CaCl2'treated  extract  that  proceeded  toward  mitosis  (A),  and  to  the  CaCU'  and  cycloheximide-treated, 
interphase-arrested  extract  (O).  The  degradation  of  Phe-DHFR  was  monitored  as  described  in  A. 


pathway  (see  introduction).  Previous  work  has  shown 
that  pretreatment  of  reticuloc3rte  extract  with  RNAase 
A  does  not  significantly  affect  the  degradation  of  N-end 
rule  substrates  bearing  primary  destabilizing  residues 
but  abolishes  the  degradation  of  otherwise  identical 
substrates  bearing  secondary  or  tertiary  destabilizing 
N-terminal  residues  (19).  In  agreement  with  these 
findings,  preincubation  of  Xenopiis  egg  extract  with 
RNAase  A  selectively  inhibited  the  degradation  of  Cys- 
DHFR  and  Asp-DHFR,  but  had  no  effect  on  the  degra¬ 
dation  of  Arg-DHFR  (Fig.  3B).  We  conclude  that  both 
Cys  and  Asp  are  secondary  destabilizing  residues  in 
the  N-end  rule  pathway  of  Xenopus  eggs.  Taken  to¬ 
gether  with  the  earlier  findings  about  the  N-end  rule 
pathway  in  reticulocyte  extract  (19),  the  present  find¬ 
ings  are  consistent  with  the  conjecture  that  the  set  of 
secondary  and  tertiary  destabilizing  residues  in  the 
N-end  rule  ofXenopus  eggs  is  identical  to  that  of  rabbit 
reticulocytes:  two  tertiary  destabilizing  residues  (Asn 
and  Gin)  and  three  secondary  ones  (Asp,  Glu,  and  Cys). 

We  also  asked  whether  the  rate  of  degradation  of  an 
N-end  rule  substrate  depends  on  physiologically  rele¬ 
vant  changes  in  the  state  of  an  egg  extract.  U^ertil- 
ized  Xenopus  eggs  are  arrested  at  the  metaphase  of 
meiosis  II  through  the  action  of  c3d;ostatic  &ctor  (39, 
40).  Fertilization  results  in  a  transient  increase  of  the 
Ca^*^  concentration  in  the  eggs  that  inactivates  CSF, 
allowii^  cells  to  enter  interphase  and  proceed  to  mito¬ 
sis.  This  process  can  be  mimicked  in  vitro  by  the  addi¬ 
tion  of  Ca^"^  (32).  Such  extracts,  referred  to  as  cycling 
extracts,  imdergo  changes  characteristic  of  eggs  at  mi¬ 


tosis,  as  observed  through  alterations  in  the  morphol¬ 
ogy  of  Xenopus  sperm  nuclei  added  to  the  extract.  The 
addition  of  cycloheximide  and  CaCl2  to  a  CSF-arrested 
extract  results  not  only  in  the  escape  fiinm  CSF  arrest 
but  also  in  a  subsequent  arrest  at  interphase,  owing  to 
the  inhibition  of  protein  s3mthesis,  specificadly  the  syn¬ 
thesis  of  cyclins  (32).  We  examined  the  degradation  of 
Phe-DHFR,  bearing  a  t3q)e  2  primary  destabilizing  N- 
terminal  residue,  in  the  CSF-arrested,  interphase-ar¬ 
rested,  and  cycling  egg  extracts,  and  found  no  signifi¬ 
cant  differences  in  the  rate  of  degradation  of  this  N-end 
rule  substrate  in  different  extracts  (Fig.  3C). 

Dipeptides  Bearing  Destabilizing  N-Terminal 

Residues  Are  Efficacious  Inhibitors  of  the  N-End 

Rule  Pathway  in  Xenopus  Egg  Extracts 

Previous  work  has  shown  that  the  addition  of  amino 
acid  derivatives  such  as  dipeptides  bearing  destabiliz¬ 
ing  N-terminal  residues  to  reticulocyte  extracts  or  in¬ 
tact  yeast  cells  results  in  a  strong  and  selective  inhibi¬ 
tion  of  the  N-end  rule  pathway  in  these  settings  (19, 
27).  However,  experiments  with  intact  mammalian 
cells  in  culture  have  shown  dipeptides  to  be  at  most 
weak  inhibitors  of  the  N-end  rule  pathway  in  this 
setting®.  The  present  work  stemmed  in  part  from  the 
possibility  of  exploring  the  ph}rsiological  consequences 
of  pertmrbing  a  metazoan  N-end  rule  pathway  in  the 
inhibitor-accessible  setting  of  a  cell-free  system  such  as 
a  Xenopus  egg  extract. 
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FIG.  4.  Inhibition  of  the  N-end  rule  pathway  in  egg  extracts  by 
dipeptides  bearing  destabilizing  N-tenninal  residues.  (A)  “S-la- 
beled,  purified  Ub-Arg-DHFR  was  added  to  CSF-arrested  Xenopus 
egg  extracts  in  the  absence  (□)  or  the  presence  of  the  following 
dipeptides  (10  mM),  together  with  bestatin  (50  /xg/ml):  Arg-/3-Ala  (O), 
Lys-Ala  (O),  Tyr-His  (A),  Trp-Ala  (ffi),  and  Ala-Lys  (X).  After  incu¬ 
bation  at  23'C  for  the  indicated  times,  the  amounts  of  ®®S  soluble  in 
5%  TCA  were  determined  for  each  time  point.  (B)  Same  as  in  A  but 
with  Ub-Phe-DHFR. 


As  shown  in  Figs.  4  and  5,  Lys-Ala  and  espetdally 
Arg-^-Ala,  both  of  which  bear  a  type  1  primary  desta¬ 
bilizing  N-terminal  residue,  strongly  inhibited  the  deg¬ 
radation  of  Arg-DHFR  in  the  egg  extract.  This  inhibi¬ 
tion  was  selective,  in  that  the  same  dipeptides  had  no 
significant  effect  on  the  degradation  of  Phe-DHFR, 
which  bore  a  type  2  primary  destabilizing  N-terminal 
residue.  Conversely,  Tyr-His  and  Trp-Ala,  which  bear 
type  2  rieytflhilizing  N-terminal  residues,  strongly  in¬ 
hibited  the  degradation  of  Phe-DHFR,  but  had  at  most 
a  small  effect  on  the  degradation  of  Arg-DHFR  (Figs.  4 
and  5).  Ala-Lys,  which  bears  a  type  3  destabilizing 
N-terminal  residue  (19),  did  not  inhibit  the  degrada¬ 
tion  of  either  Arg-  or  Phe-DHFRs  (Fig.  4). 

The  dipeptides  had  little  effect  on  the  degradation  of 
their  cognate  N-end  rule  substrates  during  the  first  15 
min  of  the  incubation,  but  subsequent  proteolysis  was 
strongly  inhibited  (Fig.  4).  This  result  is  consistent 
with  the  finding  that  20-30%  of  Met-DHFR,  which 
bears  a  stabilizing  N-terminal  residue,  was  rapidly 
degraded,  whereas  the  rest  of  the  added  Met-DHFR 
was  stable  (Fig.  3A).  As  described  above,  the  latter 
finding  could  be  accoimted  for  if  our  preparation  of 
Met-DHFR  (and,  by  inference,  of  other  X-DHFRs)  com¬ 
prised  two  distinct  populations  of  molecules:  undam¬ 
aged  ones  (~70%)  and  hence,  in  the  case  of  Met-DHFR, 
long-lived  in  the  extract,  and  misfolded  or  otherwise 
damaged  molecules  that  were  recognized  and  degraded 
by  a  proteolsdic  system  distinct  firom  the  N-end  rule 
pathway.  The  fiulure  of  dipeptides  to  inhibit  the  initial 
burst  of  degradation  of  their  cognate  X-DHFRs  (Fig.  4) 
is  in  agreement  with  this  explanation. 


Apoptosis  and  the  N-End  Rule  Pathway 

Several  groups  have  described  the  use  of  Xenopus 
egg  extracts  to  analyze  the  process  of  apoptosis  (34,  36, 
41-43).  After  several  hours  of  incubation  of  the  inter¬ 
phase-arrested,  apoptotic  egg  extracts,  the  activity  of 
DEVD-specific  caspases  (a  subgroup  in  the  ICE/CED-3 
family  of  cysteine  proteases)  (43)  rises  sharply,  fol¬ 
lowed  by  fragmentation  of  the  added  sperm  nuclei  (34). 
Both  the  induction  of  caspases  and  the  fragmentation 
of  the  added  nuclei  in  the  egg  extract  can  be  suppressed 
by  the  addition  of  Bcl2,  a  protein  that  inhibits  apopto¬ 
sis  in  vivo  (34,  36).  Recent  work  has  shown  that  apo¬ 
ptotic  changes  in  egg  extract  are  triggered,  through 
unknown  cytosolic  factors,  by  the  release  of  c3h;ochrome 
c  from  the  mitochondrial  fraction  of  the  extract  (43). 
Apparently  Bcl2  inhibits  apoptosis  by  blocking  the  re¬ 
lease  of  c3d;ochrome  c  from  mitochondria  (43,  44). 

We  wished  to  determine  whether  the  apoptosis-like 
events  in  egg  extracts  would  be  perturbed  by  dipeptide 
inhibitors  of  the  N-end  rule  pathway.  In  preliminary 
experiments,  we  found  that  the  times  of  apoptotic 
changes  in  egg  extracts  varied  considerably  between 
independent  extract  preparations.  Specifically,  the 
sharp  rise  of  the  DEVD-specific  protease  activity  and 
the  disintegration  of  the  added  sperm  nuclei  could  oc¬ 
cur  as  early  as  2  h  after  the  start  of  incubation  or  as 


A  Minutes 


no  inhibitors 


B  Minutes 


no  inhibitors 


Arg-p-Ala  Arg-p-Ala 


> 


Trp-Ala  Trp-Ala 

FIG.  5.  Electrophoretic  analysis  of  N-end  rule  substrates  in  egg 
extracts  in  the  presence  of  dipeptides.  (A)  ^*S-labeled,  purified  Ub- 
Arg-DHFR  was  added  to  a  CSF-arrested  Xenopus  egg  extract  in  the 
absence  or  the  presence  of  dipeptides  Arg- J3- Ala  or  Trp-Ala  (at  10 
mM,  together  with  bestatin  at  50  /xg/ml).  After  incubation  at  23*C  for 
the  indicated  times,  the  samples  were  analyzed  by  SDS—PAGE,  and 
the  labeled  proteins  were  detected  by  autoradiography.  Open  arrow¬ 
heads  indicate  the  bands  of  the  initial,  34-kDa  Ub-Arg-DHFR. 
Closed  arrowheads  indicate  the  bands  of  deubiquitylated,  26-kDa 
Arg-DHFR.  (B)  Same  as  in  A  but  with  Ub-Phe-DHFR. 
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FIG.  6.  Apoptotic  changes  in  Xenopus  egg  extracts,  assayed  by 
measuring  the  DEVD-specific  protease  activity.  (A)  A  preparation  of 
the  CSF-arrested  egg  extract  was  divided  into  three  samples.  The 
extracts  were  left  untreated  (□)  or  were  activated  by  either  0.4  mM 
CaClo  (O)  or  0.4  mM  CaClg  in  the  presence  of  0.1  mg/ml  cyclohexi* 
midelO).  The  samples  were  incubated  at  room  temperature  for  up  to 
5  h.  The  DEVD-specific  protease  activity  in  the  extracts  was  deter¬ 
mined  using  the  fluorogenic  substrate  Ac-DEVD-AMC,  (B)  Samples 
of  interphase-arrested  egg  extract  were  incubated  either  in  the  ab¬ 
sence  (O)  or  in  the  presence  of  the  following  reagents:  Lys-Ala  at  20 
mM  (EB);  Ala-Lys  at  20  mM  (O);  Trp-Ala  at  20  mM  (X);  Trp-Ala  plus 
Lys-Ala.  each  at  10  mM  (O);  Ac-DEVD-CHO  (an  inhibitor  of  DEVD 
proteases,  at  10  mM)  (A).  The  dipeptides  were  added  together  with 
bestatin  (50  /ig/ml).  The  DEVD-specific  protease  activity  was  mea¬ 
sured  as  in  A. 


late  as  5  h  (data  not  shown).  This  variability,  reported 
by  others  as  well  (43),  is  likely  to  result  in  part  from  the 
variability  of  the  tuning  of  upstream  events  that  trig¬ 
ger  the  begiiming  of  apoptosis  in  the  Xenopus  egg  ex¬ 
tract.  In  addition,  the  timing  of  apoptotic  changes  de¬ 
pended  on  the  state  of  the  extracts:  when  a  single 
preparation  was  split  into  three  samples,  and  was  used 
as  CSF-arrested,  cycling,  and  interphase-arrested  ex¬ 
tracts,  the  increase  of  the  DEVD-specific  protease  ac¬ 
tivity  was  observed  first  in  the  interphase-arrested 
extract,  then  in  the  CSF-arrested  one,  and  later  in  the 
cycling  extract  (Fig.  6A).  This  pattern  was  reproducible 
between  different  preparations  of  the  extract,  even 
though  the  absolute  timing  of  the  rise  of  the  DEVD- 
specific  protease  activity  varied  significantly  between 
preparations  (data  not  shown). 

We  asked  whether  selective  inhibition  of  the  N-end 
rule  pathway  would  affect  the  nature  or  kinetics  of  the 
apoptotic  changes.  Dipeptides  that  had  been  shown  to 
be  efficacious  inhibitors  of  the  N-end  irule  pathway 
(Figs.  2  and  3A)  were  added  to  the  interphase-arrested 
and  cycling  extracts  at  the  beginning  of  incubation.  We 
found  that  Lys-Ala  and  Arg-^-Ala,  containing  type  1 
primary  destabilizing  N-terminal  residues,  and  Ala- 
L3rs,  containing  a  type  3  destabilizing  residue,  did  not 
alter  the  timing  of  DEVD-specific  protease  induction 
(Fig.  6B).  In  contrast,  Trp-Ala,  which  bears  a  type  2 
destabilizing  N-terminal  residue,  delayed  both  the  rise 
of  the  DEVD-specific  protease  activity  and  the  disinte¬ 
gration  of  sperm  nuclei  by  --1  h  in  8  out  of  12  indepen¬ 


dent  experiments  (Fig.  6B  and  data  not  shown).  The 
-1  h  delay  of  apoptotic  changes  by  Trp-Ala  was  signif¬ 
icant  but  weak  in  comparison  to  the  effect  of  Ac-DEVD- 
CHO,  an  inhibitor  of  DEVD-specific  proteases,  which 
delayed  apoptosis  by  at  least  2  h  (data  not  shown). 
Interestingly,  the  proteasome  inhibitor  5-iodo-4-hy- 
droxyl-3-nitrophenylacetyl-leucinyl-leucinyl-leucine- 

vinyl  sulfone  (NIP-L3VS)  (45),  which  mhibited  the  deg¬ 
radation  of  Phe-DHFR  in  egg  extracts,  also  delayed 
apoptosis  in  the  CSF-arrested  extract  by  h  (data 
not  shown),  similarly  to  the  effect  of  Trp-Ala  (Fig.  6B). 

DISCUSSION 

We  report  the  following  results: 

(DA  whole-cell  extract  ofX  laevis  eggs  contains  the 
N-end  rule  pathway,  as  determined  by  monitoring  the 
metabolic  fates  of  purified,  ^^S-labeled  Ub-X-DHFR 
test  proteins  added  to  the  extract.  Among  the  cells  of 
mxilticellular  organisms,  the  N-end  rule  pathway  has 
been  identified  and  characterized  in  rabbit  reticulo- 
C3rtes  (18)  and  L  cells,  a  mouse  cell  line  (19, 26),  but  not, 
until  now,  in  germ  line  cells. 

(2)  The  partial  N-end  rule  defined  in  egg  extracts 
was  found  to  be  identical  to  the  corresponding  subsets 
of  the  N-end  rules  in  rabbit  reticulocytes  and  mouse 
fibroblasts.  In  particular,  we  showed  that  in  egg  ex¬ 
tracts  the  destabilizing  activity  of  N-terminal  Asn, 
Asp,  and  Cys  requires  the  presence  of  RNA  (presum¬ 
ably  tRNA),  strongly  suggesting  that  these  are  second¬ 
ary  (Asp  and  Cys)  and  tertiary  (Asn)  residues  in  the 
egg^s  N-end  rule  pathway,  as  they  have  been  shown  to 
be  in  other  metazoan  cells  (see  introduction  for  the 
terminology)  (7). 

(3)  The  activity  of  the  N-end  rule  pathway  did  not 
change  significantly  upon  shifts  of  the  CSF-arrested  egg 
extract  to  the  cycling  or  the  interphase-arrested  state. 

(4)  Dipeptides  bearing  destabilizing  N-terminal  res¬ 
idues  selectively  and  efficiently  inhibited  the  degrada¬ 
tion  of  N-end  rule  substrates  in  egg  extract.  In  partic¬ 
ular,  dipeptides  bearing  the  basic  (type  1)  primary 
destabilizing  N-terminal  residues  inhibited  the  degra¬ 
dation  of  N-end  rule  substrates  bearing  t3q)e  1  but  not 
t3rpe  2  N-terminal  residues.  A  converse  inhibition  pat¬ 
tern  was  observed  with  dipeptides  bearing  bulky  hy¬ 
drophobic  (type  2)  N-terminal  residues. 

(5)  The  onset  of  apoptosis-like  changes  in  egg  extract 
(fragmentation  of  the  added  sperm  nuclei  and  abrupt 
increase  of  the  DEVD-specific  protease  activity  (34, 
43))  was  found  to  be  delayed  by  -1  h  in  the  presence  of 
dipeptide  Trp-Ala,  which  bears  a  type  2  destabilizing 
N-terminal  residue,  but  not  in  the  presence  of  dipep¬ 
tides  bearing  type  1  destabilizing  N-terminal  residues. 

The  N-end  rule  pathway  has  been  identified  in  all 
organisms  examined,  from  mammals  to  fimgi  and  bac- 
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teria,  but  the  understanding  of  its  functions  remains 
incomplete  (6,  7).  The  first  clear  example  of  a  physio¬ 
logical  function  of  the  N-end  rule  pathway  is  its  re¬ 
cently  identified  role  in  controlling  the  import  of  pep¬ 
tides  in  S.  cerevisiae  through  the  degradation  of  the 
transcriptional  repressor  Cup9p  that  down-regulates  a 
peptide  transporter  (25,  46).  It  has  also  been  reported 
that  dipeptides  bearing  destabilizing  N-terminal  resi¬ 
dues  spedfically  inhibit  difierentiation  of  rat  pheochro- 
mocytoma  PC12  cells  (47),  the  neurite  outgrowth  in 
amphibian  neuroepitheUal  cells  (48),  and  limb  regen¬ 
eration  in  the  newt  (49),  suggesting  a  role  for  the  N-end 
rule  pathway  in  these  biological  processes.  Unfortu¬ 
nately,  none  of  these  studies  (47-49)  included  control 
experiments  to  verify  that  the  added  dipeptides  actu¬ 
ally  inhibited  degradation  of  a  reporter  N-end  rule 
substrate  in  the  target  cells.  Thus,  it  remains  to  be 
determined  whether  the  observed  biological  effects  of 
dipeptides  (47-49)  were  caiised  by  specific  inhibition  of 
the  N-end  rule  pathway  in  the  target  cells  or  resulted 
fi-om  other  effects  of  the  added  dipeptides. 

Apart  fi:om  the  interest  in  determining  whether  the 
N-end  rule  pathway  is  present  in  germ  line  cells  such 
as  Xenopus  eggs,  one  reason  for  initiating  this  work 
was  the  opportunity  to  test,  through  selective  inhibi¬ 
tion  of  the  N-end  rule  pathway  in  egg  extracts,  whether 
this  pathway  plays  a  role  in  apoptosis.  Several  lines  of 
evidence  suggest  a  role  for  the  Ub/proteasome  system, 
of  which  the  N-end  rule  pathway  is  a  part,  in  regulat¬ 
ing  apoptosis  (50-55).  In  particular,  cell-penetrating 
proteasome  inhibitors  were  reported  to  partially  pro¬ 
tect  nonproliferating  cells  such  as  thymocytes  and 
growth  factor-deprived  sympathetic  neurons  firom  ap¬ 
optosis  (52,  53).  In  contrast,  the  same  proteasome  in¬ 
hibitors  were  reported  to  induce  apoptosis  in  mitoti- 
cally  active  cells  (51,  54).  An  apoptosis-like  process, 
induced  by  the  stress  of  starvation  and  mediated  by 
proteolysis,  was  also  identified  in  E.  coli  (56).  A  key 
regulator  of  E.  coli  apoptosis  is  a  short-lived  protein 
MazE,  which  is  degraded  by  ClpAP,  a  proteasome-like 
ATP-dependent  protease  t^t  targets  N-end  rule  sub¬ 
strates  in  E.  coli  (38).  These  and  other  considerations, 
iTioinfhng  the  possibility  that  cleavages  by  caspases  can 
produce  N-end  rule  substrates,  led  to  ^e  suggestion 
that  the  N-end  rule  pathway  may  function  in  the  con¬ 
trol  of  apoptosis  (7). 

Specific  dipeptides,  which  have  been  shown  to  act  as 
selective  inhibitors  of  the  N-end  rule  pathway  in  re- 
ticuloc}ie  extracts  (19, 27)  and  intact  S.  cerevisiae  cells 
(28),  were  found  to  be  largely  ineffective  with  several 
lines  of  mammalian  cells^.  Xenopus  egg  extracts,  in 
contrast  to  reticuloc3d;e  extracts,  have  been  shown  to 
undergo  apoptosis-like  changes  (34,  36,  43),  hence  the 
choice  of  these  extracts  for  the  present  study.  We  found 
that  dipeptides  bearing  either  type  1  (Arg,  Lys)  or  type 
3  (Ala)  Hpstahihmng  N-terminal  residues  had  no  effect 


on  the  apoptotic  changes  in  the  extracts.  In  contrast, 
Trp-Ala,  which  bears  a  type  2  destabilizing  N-terminal 
residue,  inhibited  apoptosis  in  a  number  of  indepen¬ 
dent  experiments  with  different  preparations  of  egg 
extracts;  in  a  minority  of  tests  the  effect  was  not  ob¬ 
served.  Tyr-His,  a  dipeptide  bearing  another  type  2 
destabilizing  N-terminal  residue,  strongly  inhibited 
the  degradation  of  an  N-end  rule  substrate  such  as 
Phe-DHFR  in  the  extract,  but  was  significantly  less 
efficacious  thsm  Trp-Ala  in  its  effect  on  the  timing  of 
apoptotic  changes  in  the  same  extract.  Taken  together, 
the  results  of  experiments  with  dipeptides  as  inhibitors 
of  apoptosis  indicate  that  some  inhibitors  of  the  N-end 
rule  pathway,  notably  Trp-Ala,  can  cause  a  significant 
delay  of  apoptosis  in  the  extract.  A  definitive  test  and 
further  analysis  of  the  thus  suggested  function  of  the 
N-end  rule  pathway  in  apoptosis  will  require  mutants 
that  eliminate  this  pathway  in  a  multicell\ilar  organ¬ 
ism  without  perturbing  the  rest  of  the  Ub  system. 
Construction  of  a  mouse  mutant  that  lacks  the  entire 
N-end  rule  pathway  (through  a  deletion  of  the  Ubrl 
gene  that  encodes  N-recognin,  the  main  recognition 
component  of  this  proteolytic  system)  is  under  way^. 

In  summary,  we  established  the  existence  of  the 
N-end  rule  pathway  in  germ  line  cells  such  as  Xenopus 
eggs,  showed  that  dipeptides  are  efficacious  inhibitors 
of  the  N-end  rule  pathway  in  egg  extracts,  and  exam¬ 
ined  the  effects  of  dipeptides  on  apoptotic  changes  in 
these  extracts. 

ACaCNOWLEDGMENTS 

We  are  grateful  to  W,  Dunphy  for  helpful  discussions  and  the 
access  to  his  laboratory's  Xenopus  facility,  and  to  M.  Bogyo  and  H. 
Ploegh  for  a  gift  of  the  NIP-L3VS  proteasome  inhibitor.  We  thank  E. 
Smirnova  and  members  of  the  Varshavsky  lab,  especially  L.  Peck,  G. 
Turner,  and  A.  Webster,  for  their  suggestions  and  for  the  comments 
on  the  manuscript.  D.P.  is  a  Fellow  of  the  Leukemia  Society  of 
America.  This  work  was  supported  by  grants  to  A.V,  from  the  Na¬ 
tional  Institutes  of  Health  (DK39520  and  GM31530). 

REFERENCES 

1.  Haas,  A.  J.,  and  Siepman,  T.  J.  (1997)  FASEB  J.  11, 1257-1268. 

2.  Varshavsky,  A.  (1997)  Trends  Biockem.  Sci.  22,  383-387. 

3.  Hochstrasser,  M.  (1996)  Annu.  Rev.  Genet  30, 405-439. 

4.  King,  R.  W.,  Deshaies,  R.  J,,  Peters,  J.  M.,  and  Kirschner,  M.  W. 
(1996)  Science  274,  1652-1659. 

5.  Varshavsky,  A.  (1991)  Cell  64,  13-15. 

6.  Bachmair,  A.,  Finley,  D.,  and  Varshavsky,  A.  (1986)  Science  234, 
179-186. 

7.  Varshavsky,  A.  (1996)  Proc.  NatL  Acad.  ScL  USA  93,  12142- 
12149. 

8.  Bachmair,  A.,  and  Varshavsky,  A.  (1989)  Cell  56,  1019-1032. 

9.  Johnson,  E.  S.,  Gonda,  D.  K.,  and  Varshavsky,  A.  (1990)  Nature 
3^,  287-291. 


^  Y.  T.  Kwon  and  A.  Varshavsky,  unpublished  data. 


THE  N-END  RULE  IN  XENOPUS 


325 


10  Hill  C.  P.,  Johnston,  N.  L.,  and  Cohen,  R.  E.  (1993)  Proc.  Natl. 

'  '  Acad!  Sci.  USA  90,  4136-4140. 

11.  Chau,  V.,  Tobias,  J.  W.,  Bachmair,  A.,  Marriott,  D.,  Ecker,  D.  J., 
Gonda,  D.  K,  and  Varshavsky,  A.  (1989)  Science  243, 1576-1583. 

12.  Pickart,  C.  M.  (1997)  FASEB  J.  11, 1055-1066. 

13.  Wilkinson,  K  D.  (1997)  FASEB  J.  11,  1245-1256. 

14.  Hershko,  A.,  and  Ciechanover,  A.  (1992)  Annu.  Reo.  Biochem.  61, 
761-807. 

15.  Finley,  D.,  and  Chau,  V.  (1991)  Annu.  Rev.  Cell  Biol.  7,  25-69. 

16.  Couz,  O.,  Tanaka,  K.,  and  Goldberg,  A.  L.  (1996)  Annu.  Rev. 
Biochem.  65,  801-817. 

17.  Hilt,  W.,  and  Wolf,  D.  H.  (1996)  Trends.  Biochem.  Sci.  21, 96-102. 

18.  Lupas,  A.,  Flanagan,  J.  M.,  Tamura,  T.,  and  Baumeister,  W. 
(1997)  Trends  Biochem.  Sci.  22,  399-404. 

19.  (jonda,  D.  K,  Bachmair,  A.,  Wiinning,  L,  Tobias.  J.  W.,  Lane,  W.  S., 
and  Varshavsky,  A.  (1989)  J.  BioL  Chem  264, 16700—16712. 

20.  Bpl”  E.,  Choder,  M.,  Chen,  W.,  Varshavsky,  A.,  and  Goffeau,  A. 
(1990)  J.  Biol.  Chem  265,  7464-7471. 

21.  Baker,  R.  T.,  and  Varshavsky,  A.  (1995)  J.  Biol.  Chem.  270, 
12065-12074. 

22.  B.,  Wiinning,  I.,  and  Varshavsky,  A.  (1990)  EMBO  J.  9, 
3179-3189. 

23.  deGroot,  R.  J..  Rumenapf,  T.,  Kuhn,  R.  J.,  and  Strauss,  J.  H. 
(1991)  Proc.  Natl.  Acad.  Sci.  USA  88,  8967-8971. 

24.  Madura,  K.,  and  Varshavsky,  A.  (1994)  Science  265, 1454-1458. 

25.  Byrd,  C.,  Turner,  G.  C.,  and  Varshavsky,  A.  (1998)  EMBO  J.  17, 
269-277. 

26.  Levy,  F.,  Johnsson,  N.,  Rumenapf,  T.,  and  Varshavsky,  A.  (1996) 
Proc.  Natl.  Acad.  Sci.  USA  93, 4907—4912. 

27.  Rpiss,  Y.,  KaiTTi,  D.,  and  Hershko,  A.  (1988)  J.  Biol.  Chem.  263, 
2693-269. 

28.  Baker,  R.  T.,  and  Varshavsky,  A.  (1991)  Proc.  Natl.  Acad.  Sci. 
USA  87,  2374-2378. 

29.  Dunphy,  W.  G.  (1994)  Trends  Cell  Biology  4,  202-207. 

30.  Johnston,  J.  A.,  Johnson,  E.  S.,  Waller,  P.  R.  H.,  and  Varshavsky, 
A.  (1995)  J.  Biol.  Chem.  270,  8172-8178. 

31  Ausubel,  F.  M.,  Brent,  R.,  Kingston,  R.  E.,  Moore,  D.  D.,  Smith, 
J.  A.,  Seidman,  J.  G.,  and  Struhl,  K  Eds.  (1996)  Current  Proto¬ 
cols  in  Mnleciilar  Biology,  Wiley-Intersdence,  New  York. 

32.  Murray,  A.  W.  (1991)  Methods  Cell.  Bud.  36,  581-605. 

33.  Grigoryev,  S.,  Stewart,  A.  E.,  Kwon,  Y.  T.,  Arfin,  S.  M.,  Brad¬ 
shaw,  R.  A.,  Jenkins,  N.  A.,  Copeland,  N.  J.,  and  Varshavsky,  A. 
(1996)  e/.  Biol.  Chem.  271,  28521—28532. 

34.  Newmeyer,  D.  D.,  Farschon,  D.  M.,  and  Reed,  J.  C.  (1994)  Cell 
79,353-364. 

35.  Nicholson,  D.  W.,  Ali,  A.,  Thomberry,  N.  A.,  Vaillancourt,  J.  P., 
Ding,  C.  K,  Gallant,  M.,  Gareau,  Y.,  GrifiSn,  P.  R.,  Labelle,  M., 


T.a^ahnilc  Y.  A..  Munday,  N.  A.,  Raju,  S.  M.,  Smulson,  M.  E., 
Yamin,  T.-T.,  Yu,  V.  L.,  and  Miller,  D.  K.  (1995)  Nature  376, 
37-43. 

36.  CosuUch,  S.  C.,  Green,  S.,  and  Clarke,  P.  R.  (1996)  Curr.  Biol.  6, 
353-364. 

37.  Johnson,  E.  S.,  Ma,  P.  C.  M.,  Ota.  I.  M.,  and  Varshavsky,  A. 
(1995)  J.  BioL  Chem.  270,  17442-17456. 

38.  Tobias,  J.  W.,  Shrader,  T.  E.,  Rocap,  G.,  and  Varshavsky,  A. 
(1991)  Science  254, 1374-1377. 

39.  Sagata.  N.  (1996)  Trends  Cell  Biol.  6,  22-28. 

40.  Furuno,  N.,  Ogawa.  Y.,  Iwashita,  J.,  Nakajo,  N.,  and  Sagata,  N. 
(1997)  EMBO  J.  16,  3860-3865. 

41.  Evans,  E.  K.,  Lu.  W..  Strum,  S.  L.,  Mayer,  B.  J.,  and  Kombluth. 
S.  (1997)  EMBO  J.  16,  230-241. 

42.  Kluck,  R.  M.,  Bossy-Wetzel.  E.,  Green,  D.  R.,  and  Newmeyer, 
D.  D.  (1997)  Science  275,  1132-1136. 

43.  Kluck,  R.  M.,  Martin,  S.  J.,  Hofihnan,  B.  M.,  Zhou,  J.  S.,  Green, 
D.  R.,  and  Newmeyer,  D.  D.  (1997)  EMBO  J.  16,  4639-4649. 

44.  Villfl,  P.,  tCaiifinann.  S.  H.,  and  Eamshaw,  W.  C.  (1997)  Trends 
Biochem  Sci.  22,  388—393. 

45.  Bogyo,  M.,  McMaster,  J.  S.,  Gaczynska.  M.,  Tortorella,  D.,  Gold¬ 
berg,  A.  L.,  and  Ploegh,  H.  (1997)  Proc.  Natl.  Acad.  Sci.  USA  94, 
6629-6634. 

46.  Alagramam,  K,  Naider,  F.,  and  Becker,  J.  M.  (1995)  Mol.  Mi* 
erobiol.  15,  225-234. 

47.  Hondermarck,  H.,  Sy,  J.,  Bradshaw,  R.  A.,  and  Arfin,  S.  M. 
(1992)  Biochem.  Biophys.  Res.  Commun.  30,  280-288. 

48.  J.  P.,  Bradshaw,  R.  A.,  Boilly,  B.,  and  Hondermarck, 
H.  (1996)  Int.  J.  Dev.  Biol.  40,  609-611. 

49.  Taban,  C.  H..  Hondermarck,  H.,  Bradshaw,  R.  A.,  and  Boilly,  B. 
(1996)  Experentia  52,  865-870. 

50.  Delic,  J.,  Morange,  M.,  and  Magdelenat,  H.  (1993)  Mol.  Cell. 
Biol.  13,  4875-4883. 

51.  Imajoh-Ohmi,  S..  Kawaguchi.  T.,  Sugiyama,  S.,  Tanaka.  K. 
Omura,  S.,  and  Kikuchi,  H.  (1995)  Biochem.  Biophys.  Res.  Com¬ 
mun.  217,  1070-1077. 

52.  Grimm,  L.  M.,  Goldberg,  A.  L..  Poirier,  G.  G.,  Schwartz,  L.  M., 
gnH  Osborne,  B.  A.  (1996)  EMBO  J.  15,  3835—3844. 

53.  Sadoul,  R.,  Fernandez,  P.-A.,  Quiquerez,  A.-L.,  Martinou.  I., 
Main,  M.,  Schrbter,  M.,  Becherer,  J.  D.,  Irmler,  M.,  Tschopp,  J., 
and  Martinou,  J.-C.  (1996)  EMBO  J.  15,  3845—3852. 

54.  Drexler,  H.  C.  A.  (1997)  Proc.  Natl.  Acad.  Sci.  USA  94,  855-860. 

55.  Lopes,  U.  G.,  Erhardt,  P..  Yao,  R.,  and  Cooper,  G.  M.  J.  Biol. 
Chem  272, 12893-12896. 

56.  Aizenman,  E.,  Engelberg-Kulka,  H.,  and  Glaser,  G.  (1996)  Proc. 
Natl.  Acad.  Sci.  USA  93,  6059-6063. 


The  EMBO  Journat  Vol.17  No.1  pp.269-277,  1998 


The  N-end  rule  pathway  controls  the  import  of 
peptides  through  degradation  of  a  transcriptional 
repressor 


Christopher  Byrd,  Glenn  C.Turner  and 
Alexander  Varshavsky'* 

Division  of  Biology,  California  Institute  of  Technology, 

1200  East  California  Boulevard,  Pasadena,  CA  91125,  USA 

’Corresponding  author 
e-mail:  avarsh@cco.caltech.edu 

Ubiquitin-dependent  proteolytic  systems  underlie 
many  processes,  including  the  cell  cycle,  cell  di^erenti- 
ation  and  responses  to  stress.  One  such  system  is  the 
N-end  rule  pathway,  which  targets  proteins  bearing 
destabilizing  N-terminal  residues.  Here  we  report  that 
Ubrlp,  the  main  recognition  component  of  this  path¬ 
way,  regulates  peptide  import  in  the  yeast  Saccharo- 
myces  cerevisiae  through  degradation  of  Cup9p,  a  35 
kDa  homeodomain  protein.  Cup9p  was  identified  using 
a  screen  for  mutants  that  bypass  the  previously 
observed  requirement  for  Ubrlp  in  peptide  import. 
We  show  that  Cup9p  is  a  short-lived  protein  {ti/2 
~5  min)  whose  degradation  requires  Ubrlp.  Cup9p 
acts  as  a  repressor  of  PTR2^  a  gene  encoding  the 
transmembrane  peptide  transporter.  In  contrast  to 
engineered  N-end  rule  substrates,  which  are  recognized 
by  Ubrlp  through  their  destabilizing  N-terminal 
residues,  Cup9p  is  targeted  by  Ubrlp  through  an 
interna]  degradation  signal.  The  Ubrlp~Cup9p-Ptr2p 
circuit  is  the  first  example  of  a  physiological  process 
controlled  by  the  N-end  rule  pathway.  An  earlier  study 
identified  Cup9p  as  a  protein  required  for  an  aspect 
of  resistance  to  copper  toxicity  in  S.cerevisiae.  Thus, 
one  physiological  substrate  of  the  N-end  rule  pathway 
functions  as  both  a  repressor  of  peptide  import  and  a 
regulator  of  copper  homeostasis. 

Keywords:  CUP9/N-end  rule/peptide  import/proteolysis/ 
PTRIIUBRJ 


Introduction 

Many  regulatory  proteins  are  short-lived  in  vivo  (Schwob 
et  al,  1994;  King  et  al,  1996;  Varshavsky,  1996).  This 
metabolic  instability  makes  possible  rapid  adjustment 
of  the  protein’s  concentration  (or  subunit  composition) 
through  changes  in  the  rates  of  its  synthesis  or  degradation. 
Protein  degradation  plays  a  role  in  a  multitude  of  processes, 
including  cell  growdi,  division,  differentiation  and 
responses  to  stress.  In  eukaryotes,  a  large  fraction  of 
intracellular  proteolysis  is  mediated  by  the  ubiquitin  sys¬ 
tem.  Ubiquitin  (Ub)  is  a  76-residue  protein  whose  covalent 
conjugation  to  other  proteins  marks  them  for  processive 
degradation  by  the  26S  proteasome — an  ATP-dependent, 
multisubunit  protease  (Jentsch  and  Schlenker,  1995;  Hilt 
and  Wolf,  1996;  Hochstrasser,  1996;  Rubin  et  aL,  1997). 


Features  of  proteins  that  confer  metabolic  instability 
are  called  degradation  signals  (degrons).  One  of  the 
degradation  signals  recognized  by  the  ubiquitin  system  is 
called  the  N-degron.  It  comprises  two  essential  determin¬ 
ants:  a  destabilizing  N-terminal  residue  and  an  internal 
lysine  of  a  substrate  (Bachmair  et  aL,  1986;  Varshavsky, 
1996).  The  Lys  residue  is  the  site  of  formation  of  a 
substrate-linked  multiubiquitin  chain  (Bachmair  and 
Varshavsky,  1989;  Chau  et  aL,  1989).  A  set  of  N-degrons 
bearing  different  N-terminal  residues  that  are  destabilizing 
in  a  given  cell  type  yields  a  rule,  called  the  N-end  rule, 
which  relates  the  in  vivo  half-life  of  a  protein  to  the 
identity  of  its  N-terminal  residue.  Similar  but  distinct 
versions  of  the  N-end  rule  operate  in  all  organisms 
examined,  from  mammals  to  fungi  and  bacteria 
(Varshavsky,  1996). 

The  N-end  rule  pathway  is  organized  hierarchically.  In 
eukaryotes  such  as  Saccharomyces  cerevisiae,  Asn  and 
Gin  are  tertiary  destabilizing  N-terminal  residues  in  that 
they  function  through  their  conversion,  by  the  NTAl- 
encoded  N-terminal  amidase  (Nt-amidase),  into  the 
secondary  destabilizing  residues  Asp  and  Glu  (Baker  and 
Varshavsky,  1995).  Secondary  residues,  in  turn,  function 
through  their  conjugation  to  Arg  by  the  ATE7-encoded 
Arg-tRNA-protein  transferase  (R-transferase)  (Balzi  etaL, 
1990).  Arg  is  one  of  several  primary  destabilizing 
N-terminal  residues  which  are  bound  directly  by 
N-recognin,  a  225  kDa  E3  protein  encoded  by  the  UBRl 
gene  (Bartel  et  aL,  1990).  Ubrlp,  together  with  the 
associated  ubiquitin-conjugating  (E2)  enzyme  Ubc2p, 
mediates  the  formation  of  a  substrate-linked  multiubiquitin 
chain  (Varshavsky,  1996). 

The  N-end  rule  pathway  was  first  encountered  in  experi¬ 
ments  that  explored,  in  Sxerevisiae,  the  metabolic  fate  of 
a  fusion  between  Ub  and  a  reporter  such  as  Escherichia 
coli  p-galactosidase  (P-gal)  (Bachmair  1986).  While 
such  engineered  N-end  rule  substrates  have  been  extens¬ 
ively  characterized  (Varshavsky,  1996),  little  is  known 
about  their  physiological  counterparts.  The  few  identified 
so  far  include  the  GM7-encoded  Ga  subunit  of  the 
S,  cerevisiae  heterotrimeric  G  protein,  which  mediates  the 
pheromone  response  in  this  fungus,  and  RNA  polymerases 
of  alphavinises  whose  hosts  include  mammalian  and  insect 
cells  (de  Groot  et  aL,  1991;  Madura  and  Varshavsky, 

1994) .  Physiological  functions  of  the  instability  of  these 
proteins  remain  to  be  understood  (Varshavsky,  1996). 
Inactivation  of  the  N-end  rule  pathway  in  S. cerevisiae — 
through  deletion  of  the  UBRl  gene — results  in  cells  which 
grow  slightly  slower  than  their  wild-type  counterparts, 
and  are  impaired  in  sporulation  (increased  frequency  of 
asci  containing  fewer  than  four  spores),  but  otherwise 
appear  to  be  normal  (Bartel  et  aL,  1990). 

Recently,  Becker  and  colleagues  (Alagramam  et  aL, 

1995)  have  reported  that  uhrlA,  cells  are  deficient  in  the 
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Fig.  1.  The  import  of  peptides  is  decreased  in  the  absence  of  Ubc2p  and  virtually  abolished  in  the  absence  of  Ubc2p  and  Ubc4p.  Saccharomyces 
cerevisiae  mutants  deficient  in  one  or  more  ubiquitin-conjugating  (E2)  enzymes  or  other  components  of  the  ubiquitin  system  were  tested  for  their 
ability  to  import  peptides,  using  the  halo  assay  and  a  toxic  dipeptide  L-leucyl-L-ethionine  (Leu-Eth)  (see  Materials  and  methods).  *  +  \  ‘±’  and 
denote,  respectively,  the  apparently  wild-type,  significantly  reduced,  and  undetectable  levels  of  peptide  import.  A  superscript  T  refers  to  strains  that 
carried  a  ts  allele  of  an  essential  E2  enzyme,  either  Cdc34p  (Ubc3p)  or  Ubc9p;  the  30°C  temperature  of  the  test  was  semi-permissive  for  these 
strains.  (A)  Summary  of  the  results.  (B-F)  Examples  of  the  actual  halo  assays,  with  wild-type  (B),  p/r2A  (C).  uhrJA  (D),  uhc2A  (E),  uhc4A  (F)  and 
uhc2A  uhc4A  (G)  strains  of  S.cerevisiae.  Elimination  of  some  E2  enzymes  in  the  ubc2A  background  restored  halo  formation,  presumably  because 
such  strains  were  growth-impaired  in  a  way  that  made  them  hypersensitive  to  the  toxicity  of  eihionine.  By  contrast,  although  uhc2A  iihc4A  cells 
were  also  growth-impaired,  they  were  import-defective  and  therefore  grew  in  the  immediate  vicinity  of  the  filter. 


import  of  di-  and  tripeptides,  suggesting  that  this  process, 
which  is  universal  among  living  cells,  requires  the  N-end 
rule  pathway.  In  the  present  work,  we  identified  the 
underlying  regulatory  mechanism  and  discovered  a  new 
physiological  substrate  of  the  N-end  rule  pathway,  the 
homeodomain  protein  Cup9p.  This  short-lived  protein 
is  targeted  for  degradation  by  Ubrlp,  and  acts  as  a 
transcriptional  repressor  of  PTR2,  a  gene  that  encodes  a 
transmembrane  peptide  transporter.  The  Ubrlp-Cup9p- 
Ptr2p  circuit  is  the  first  example  of  a  physiological  process 
controlled  by  the  N-end  rule  pathway. 

Results 

The  involvement  of  Ubc2p  and  Ubc4p  E2  enzymes 
in  the  control  of  peptide  import 

We  began  by  asking  whether  components  of  the  N-end 
rule  pathway  other  than  Ubrlp  were  also  necessary  for 
the  import  of  peptides.  Previous  work  has  shown  that 
Ubc2p,  one  of  13  ubiquitin-conjugating  (E2)  enzymes  of 
Sxerevisiae,  is  required  for  the  degradation  of  engineered 
N-end  rule  substrates,  and  is  physically  associated  with 
the  E3  protein  Ubrlp  (N-recognin)  (Jentsch,  1992;  Madura 
et  ai,  1993). 

To  test  for  the  ability  of  Sxerevisiae  to  import  peptides, 
we  used  a  halo  assay,  in  which  a  filter  soaked  in  the  toxic 
dipeptide  L-leucyl-L-ethionine  (Leu-Eth)  is  placed  on  a 
plate,  inhibiting  the  growth  of  import-competent  cells  near 
the  filter.  By  this  test,  the  elimination  of  UBC2  impaired, 
but  did  not  abolish,  the  import  of  peptides  (Figure  lA,  B 
and  E).  To  determine  which  of  the  other  E2  enzymes,  if 


any,  were  required  for  the  residual  peptide  import  observed 
in  wZ7c2A  cells,  a  number  of  single  and  multiple  mutants 
in  UBC  genes  were  examined  (Figure  1).  We  found 
that  the  elimination  of  both  UBC2  and  UBC4  virtually 
abolished  the  import  of  peptides  (Figure  lA,  B  and 
E-G).  Elimination  of  Ubc4p,  one  of  the  more  abundant 
E2  enzymes  (Bachmair  et  al,  1986;  Jentsch,  1992),  had 
previously  been  noticed  to  decrease  slightly  the  activity 
of  the  N-end  rule  pathway  (Bartel,  1990).  Cells  lacking 
either  Ntalp  or  Atelp — the  ‘upstream’  components  of  this 
pathway — were  also  examined  and  found  unimpaired  in 
the  import  of  peptides,  in  contrast  to  iibrl/A  and  ubc2/A 
ubc4A  cells  (Figure  lA). 

identification  of  Cup9p  as  a  negative  regulator  of 
peptide  import 

Previous  work  (Alagramam  et  al,  1995)  has  shown  that 
deletion  of  UBRl  greatly  reduces  the  level  of  PTR2  mRNA, 
which  encodes  the  transmembrane  peptide  transporter.  This 
result,  and  our  observation  that  peptide  import  requires 
the  presence  of  at  least  one  of  two  specific  ubiquitin- 
conjugating  enzymes  (Figure  1),  suggested  a  model  in 
which  expression  of  the  Ptr2p  transporter  is  regulated  by 
a  short-lived  repressor  that  is  degraded  by  the  N-end  rule 
pathway.  In  cells  lacking  UBRl,  the  repressor  would  be 
expected  to  accumulate,  thereby  blocking  peptide  import. 
One  prediction  of  this  model  is  that  inactivation  of  this 
repressor  would  bypass  the  requirement  for  Ubrlp  in 
peptide  import.  A  screen  for  such  ‘bypass’  mutations  (see 
Materials  and  methods)  yielded  199  recessive  isolates,  of 
which  101  defined  one  complementation  group,  termed 
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Fig.  2.  Cup9p  is  a  repressor  of  the  PTR2  gene.  (A)  Expression  of  the  S.cerevisiae  PTR2  gene  in  different  genetic  backgrounds.  Equal  amounts  of 
total  RNA  isolated  from  different  strains  were  analyzed  by  Northern  hybridization  (see  Materials  and  methods),  using  the  PTR2  (peptide  transporter) 
and  ACT]  (actin)  genes  as  ^-P-labeled  probes.  Lane  a,  JD52  {CUP9  UBRI)  (wild-type)  transformed  with  pCB201  (empty  vector).  Lane  b,  JD55 
{CUP9  ubrIA)  transformed  with  pCB201.  Lane  c,  JD52  (CUP9  UBRI)  transformed  with  the  high-copy  plasmid  pCB209  that  expressed  CVP9  from 
its  natural  promoter.  Lane  d,  CBY18  {cup9A  UBRJ).  Lane  e,  CBY16  {cup9A  ubrlA).  Lane  f,  CBY21  {CUP9  UBRI  ptr2A).  Lanes  g-i,  same  as  lanes 
a-c,  but  a  longer  autoradiographic  exposure  to  highlight  the  tight  repression  of  PTR2  by  Cup9p  in  ttbrlA  cells  (b,  h)  and  the  difference  between 
levels  of  PTR2  mRNA  in  the  wild-type  cells  (a,  g)  and  their  coumerpans  that  overexpressed  Cup9p  (c.  i).  (B)  Cup9p  specifically  binds  to  a  site  in 
the  PpTR2  promoter.  A  gel  shift  assay  with  Cup9-H6.  poIy-dl  dC  and  ^^P-labeled  DNA  fragments  of  the  PTR2  promoter  hybridization  (see  Materials 
and  methods).  Lanes  a-c,  with  a  fragment  (-1  to  -447)  proximal  to  the  inferred  sian  codon  of  the  PTR2  ORE.  Lanes  d-f,  same  as  lanes  a-c.  but 
with  a  more  distal  DNA  fragment  (-448  to  -897).  Concentrations  of  Cup9-H6  (in  |ig/ml)  are  indicated  above  the  lanes. 


sub]  (suppressor  of  a  block  to  peptide  import  in  ubrlA). 
In  agreement  with  the  model’s  prediction,  sub]  mutants 
acquired  the  ability  to  express  PTR2  in  the  absence  of 
UBR]  (Figure  2A,  lane  b  versus  lane  e).  The  sub] 
locus  was  cloned  by  complementation  (see  Materials  and 
methods),  and  was  found  to  be  the  CUP9  gene. 

CUP9  was  originally  identified  by  Knight  et  al  (1994) 
as  a  gene  whose  disruption  impairs  the  copper  resistance 
of  S.cerevisiae  growing  on  lactate,  a  non-fermentable 
carbon  source.  Under  these  conditions,  Cup9p  plays  a 
major  (but  mechanistically  obscure)  role  in  copper 
homeostasis  (Knight  et  al,  1994).  CUP9  encodes  a  35  kDa 
protein  that  contains  a  homeodomain,  an  --bO-residue 
helix-tum-helix  DNA-binding  motif  present  in  many 
eukaryotic  regulatory  proteins  (Wolberger,  1996).  Outside 
the  homeodomain  region,  the  sequence  of  Cup9p  is  not 
similar  to  sequences  in  databases. 

To  verify  that  CUP9  and  SUB]  were  the  same  gene, 
complementation  tests  were  carried  out.  Two  independ’ 
ently  derived  ubr]A  cup9::LEU2  strains  (CBY16  and 
CBY17)  were  crossed  to  ubr]A  sub]-]  (CBY15),  and  the 
resultmg  diploids  (CBY23  and  CBY24,  respectively)  were 
tested  for  their  ability  to  import  dipeptides  (see  Materials 
and  methods).  As  would  be  expected  of  allelic  loci, 
cup9::LEU2  and  sub]-]  failed  to  complement  one  another: 
both  diploid  strains  remained  import-competent  (data 
not  shown).  In  another  test,  CBY23  and  CBY24  were 
sporulated,  and  the  segregants  were  analyzed  for  the 
presence  of  the  LEU2  gene  (integrated  at  the  CUP9  locus) 
and  for  the  ability  to  import  peptides.  Among  the  eight 
tetrads  tested,  LEU2  was  present  in  two  of  the  four 
segregants,  whereas  all  four  segregants  were  invariably 
import-competent,  a  pattern  expected  if  CUP9  and  SUB] 
were  one  and  the  same  gene. 


To  examine  the  regulation  of  peptide  import  by  Cup9p 
and  Ubrlp,  congenic  S.cerevisiae  strains  that  lacked, 
expressed  or  overexpressed  Cup9p  and/or  Ubrlp  were  con¬ 
structed  and  assayed  for  peptide  import  by  growth  on  select¬ 
ive  media.  Suspensions  of  cells  (auxotrophic  for  lysine) 
were  serially  diluted  and  plated  on  either  rich  media,  min¬ 
imal  media  lacking  lysine  and  containing  Lys-Ala  dipeptide 
(selecting  for  peptide  import),  or  minimal  media  containing 
the  toxic  dipeptide  Leu-Eth  (selecting  against  peptide 
import).  All  strains  grew  at  comparable  rates  on  rich  media 
(Figure  3A).  On  minimal  media  that  supplied  the  essential 
lysine  as  the  Lys-Ala  dipeptide,  the  ClfP9  UBR]  (wild- 
type),  cup9A  UBR]  and  cup9A  ubr]A  strains  grew  at  com¬ 
parable  rates  (Figure  3B),  whereas  the  CUP9  ubr]A  strain 
failed  to  grow  (Figure  3B),  in  agreement  with  the  observa¬ 
tion  that  UBR]  is  required  for  peptide  import  (Figure  1). 
Opposite  growth  patterns  were  observed  on  media  con¬ 
taining  toxic  dipeptide  (Figure  3C).  Comparison  of  the  data 
in  Figure  3  with  PTR2  mRNA  levels  (Figure  2A)  suggested 
that  CUP9  exerts  its  effect  on  the  import  of  peptides  by 
repressing  transcription  of  PTR2.  For  example,  wild-type 
{CUP9  UBR])  cells  overexpressing  Cup9p  exhibited 
reduced  levels  of  PTR2  mRNA  (Figure  2A)  and  decreased 
sensitivity  to  toxic  dipeptide  (Figure  3C),  whereas  strains 
lacking  CUP9  (cwp9A  UBR]  and  cup9A  ubr]A)  overex¬ 
pressed  PTR2  (Figure  2A)  and  were  hypersensitive  to  toxic 
dipeptide  (Figure  3C). 

Cup9p  IS  a  repressor  of  the  PTR2  gene 

To  address  the  mechanism  of  repression  by  Cup9p,  we 
asked  whether  purified  Cup9p  (see  Materials  and  methods) 
could  selectively  bind  to  specific  regions  of  the  PTR2 
promoter.  Gel  shift  assays  in  the  presence  of  poly-dl  dC 
competitor  DNA  were  performed  with  labeled  DNA 
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Fig.  3.  Relative  capacity  for  peptide  import  in  congenic  S.cerevisiae 
strains  that  contained,  lacked  or  overexpressed  Cup9p  and/or  Ubrlp. 
Serial  dilutions  of  the  indicated  strains  were  deposited  by  a  48-pin 
applicator  onto  either  rich  (YPD)  medium  (A),  minimal  medium 
containing  66  pM  Lys-Ala  dipeptide  as  the  sole  source  of  lysine  (B), 
or  minimal  medium  containing  both  lysine  (at  110  pM)  and  the  toxic 
dipeptide  Leu-Eth  (at  55  pM)  (C).  The  plates  were  incubated  at 
for  1-2  days.  The  relevant  genetic  loci  are  shown  on  the  left.  The 
'‘UBRJy  overexpressed  CUP9'  strain  (JD52-2p-CUP9)  carried  the  high- 
copy  plasmid  pCB209  that  expressed  Cup9p  from  its  natural  promoter. 


fragments  of  the  PTR2  promoter  and  Hise-tagged  Cup9p 
(Cup9p-H6)  purified  from  E.coli.  Cup9p-H6  bound  to  a 
site  within  a  distal  region  of  the  PTR2  promoter  (positions 
-448  to  -897  relative  to  the  inferred  CUP9  start  codon), 
but  did  not  bind  to  the  proximal  region  of  the  PTR2 
promoter  (positions  —1  to  —447)  under  the  same  condi¬ 
tions  (Figure  2B). 

The  transcriptional  repressor  function  of  Cup9p  was 
further  suggested  by  the  finding  that  the  co-repressor 
complex  Tuplp/Ssn6p  plays  a  role  in  the  control  of  peptide 
import.  The  Tuplp/Ssn6p  complex  inhibits  transcription 
of  many  yeast  genes  through  interactions  with  gene- 
specific  DNA-binding  repressors  such  as  Mata2p  (Chen 
et  al,  1993;  Smith  et  al,  1995),  a  homeodomain  homolog 
of  Cup9p.  We  found  that  most  of  our  sub  mutants  that 
were  not  CUP9  mutants  could  be  complemented  by  low- 
copy  plasmids  bearing  SSN6  or  TUPl  (G.Tumer,  S.Saha 
and  A.Varshavsky,  unpublished  data).  In  addition,  deletion 
of  SSN6,  like  deletion  of  CUP9,  restored  the  ability  of 
ubrlA  cells  to  import  peptides  (data  not  shown). 

Ubr  Ip-dependent  degradation  of  Cup9p 

The  fact  that  deletion  of  CUP9  renders  PTR2  transcription 
independent  of  Ubrlp  (Figure  2A),  and  the  observation 
that  overexpression  of  Ubrlp  enhances  the  import  of 
peptides  in  Cup9p-expressing  strains  (data  not  shown) 
suggested  that  the  N-end  rule  pathway  regulates  peptide 


import  by  targeting  Cup9p  for  degradation.  To  test  this 
conjecture,  we  carried  out  pulse-chase  experiments  with 
a  C-terminally  FL AG-tagged  Cup9p  in  UBRJ  and  itbrlA 
cells.  Cup9p-FLAG  was  a  very  short-lived  protein  (t\p_ 
-5  min)  in  UBRl  cells  (Figure  4).  By  contrast,  Cup9p 
was  much'  longer-lived  (/1/2  >30  min)  in  iibrlA  cells 
(Figure  4).  Degradation  of  Cup9p  was  also  found  to 
depend  upon  UBC2  and  UBC4  (data  not  shown),  in 
agreement  with  the  observation  that  a  ubc2A  ubc4A  double 
mutant  failed  to  import  dipeptides  (Figure  IG).  The 
residual  instability  of  Cup9p  in  ubrIA  cells  (Figure  4) 
suggested  the  presence  of  a  second,  Ubrlp-independent 
degron;  this  pattern  is  reminiscent  of  another  homeo¬ 
domain  repressor,  Mata2p,  which  also  contains  at  least  two 
distinct  degradation  signals  (Hochstrasser  and  Varshavsky, 
1990;  Chen  et  al,  1993). 

Ubrlp  recognizes  engineered  N-end  rule  substrates 
through  their  destabilizing  N-terminal  residues 
(Varshavsky,  1996).  To  determine  the  N-terminal  residue 
of  Cup9p,  we  overexpressed  and  purified  Cup9p-FLAG 
from  ubrlA  S.cerevisiae  and  subjected  it  to  N-terminal 
sequencing.  Cup9p-FLAG  was  found  to  have  a  blocked 
(presumably  acetylated)  N-terminus  (see  Materials  and 
methods),  suggesting  that  Ubrlp  targets  Cup9p  through  a 
degron  distinct  from  the  N-degron.  Independent  evidence 
for  this  conjecture  was  produced  through  the  analysis  of 
GST-Cup9p-ha2,  a  fusion  of  glutathione  transferase  (GST) 
and  C-terminally  ha-tagged  Cup9p.  Pulse-chase  analysis 
of  GST-Cup9p-ha2  revealed  that  the  fusion  protein  was 
nearly  as  short-lived  in  vivo  as  the  N-terminally  unmodified 
Cup9p-FLAG  (Figure  4,  lanes  b-e;  compare  with  Figure 
5,  lanes  b-e).  If  Cup9p  were  targeted  for  processive 
degradation  through  a  destabilizing  N-terminal  residue,  a 
preliminary  proteolytic  cleavage(s)  of  Cup9p  would  be 
required  to  expose  such  a  residue  (Varshavsky,  1996).  In 
the  case  of  GST-Cup9p-ha2,  this  cleavage  would  generate 
a  proteolytic  fragment  consisting  of  GST  and  an  N-terminal 
portion  of  Cup9p  preceding  the  cleavage  site.  Since  free 
GST  has  been  found  to  be  long-lived  when  expressed  in 
yeast  (data  not  shown),  accumulation  of  such  a  proteolytic 
fragment  would  be  expected  to  accompany  the  degradation 
of  GST-Cup9p-ha2. 

Pulse-chase  analysis  of  GST-Cup9p-ha2,  using  gluta¬ 
thione-agarose  beads  to  isolate  GST-containing  proteins 
(see  Materials  and  methods),  indicated  that  the  degradation 
of  GST-Cup9p-ha2  by  the  N-end  rule  pathway  was  not 
accompanied  by  the  appearance  of  a  fragment  containing 
the  N-terminal  GST  moiety  (Figure  5).  This  finding 
strongly  suggested  that  Cup9p  bears  an  ‘internal’  degron 
recognized  by  Ubrlp.  Additional  support  for  this  conjec¬ 
ture  was  provided  by  truncation  analysis  of  a  Cup9p- 
DHFR-myc  fusion  protein.  These  experiments  indicated 
that  the  N-terminal  81  residues  of  the  306-residue  Cup9p 
are  dispensable  for  its  Ubrlp-dependent  degradation, 
strongly  suggesting  that  the  relevant  degradation  signal 
resides  in  the  C-terminal  two-thirds  of  Cup9p  (C.Byrd, 
I.Davydov  and  A.Varshavsky,  unpublished  data).  Although 
these  results  are  fully  consistent  with  the  presence  of  an 
internal  Ubrlp-dependent  degron  in  Cup9p,  there  remains 
the  less  parsimonious  possibility  that  Cup9p  is  degraded 
via  ?raw5-targeting  (Johnson  et  ah,  1990).  In  this  process, 
the  two  determinants  of  the  N-degron  (a  destabilizing 
N-terminal  residue  and  a  ubiquitin-accepting  internal  Lys 
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Fig.  4.  Degradation  of  Cup9p  by  the  N-end  rule  pathway.  Saccharomyces  cerevisiae  strains  carrying  pCB210,  a  low-copy  plasmid  that  expressed 
Cup9p-FLAG  from  the  Vcvpg  promoter,  were  labeled  at  30®C  with  [-^^Sjmethionine/cysteine  for  5  min.  followed  by  a  chase  for  0,  5,  15,  30  and 
60  min.  preparation  of  extracts,  immunoprecipitation  of  Cup9p-FLAG  with  a  monoclonal  anti-FLAG  antibody.  SDS-PAGE  and  autoradiography/ 
quantitation  (see  Materials  and  methods).  Lane  a,  JD55  (i/fer/A)  cells  that  carried  pCB200  (vector  alone).  Lanes  b-f,  JD52  iUBRl)  (wild-type)  cells 
that  carried  pCB210,  expressing  Cup9p-FLAG.  Lanes  g-k,  same  as  lanes  b-f,  but  with  JD55  {ubrlt^)  cells. 


residue)  reside  in  two  different  polypeptides,  whose  inter¬ 
action  yields  an  active  N-degron,  leading  to  ubiquitinyl- 
ation  and  degradation  of  the  polypeptide  bearing  the  Lys 
residue  (Johnson  et  al,  1990).  At  present,  little  is  known 
about  the  protein  ligands  of  Cup9p,  save  for  the  likely 
possibility  that  Cup9p  interacts  with  the  Tuplp/Ssn6p 
repressor  complex  (see  above),  similarly  to  the  previously 
established  interaction  of  this  complex  with  Mata2,  a 
homeodomain-containing  homolog  of  Cup9p  (Smith 
et  ai,  1995). 

Discussion 

The  discovery  that  Ubrlp  controls  the  import  of  peptides 
through  degradation  of  the  Cup9p  repressor  (Figure  6) 
identifies  the  first  clear  physiological  function  of  the  N-end 
rule  pathway.  The  existence  of  mammalian,  plant  and 
bacterial  homologs  of  the  yeast  Ptr2p  transporter  suggests 
that  the  import  of  peptides  in  these  organisms  may  also 
be  regulated  by  the  N-end  rule,  pathway. 

Why  was  this  pathway,  rather  than  another  Ub-depend- 
ent  proteolytic  system,  recruited  in  the  course  of  evolution 
for  the  regulation  of  peptide  import?  A  plausible  explan¬ 
ation  is  suggested  by  the  ability  of  Ubrlp  to  bind  peptides 
bearing  destabilizing  N-terminal  residues  (Varshavsky, 
1996).  Since  more  than  half  of  the  20  amino  acids  are 
destabilizing  in  the  yeast  and  mammalian  N-end  rules 
(Varshavsky,  1996),  a  significant  fraction  of  imported 
peptides  would  be  expected  to  compete  with  Cup9p  for 
binding  to  Ubrlp.  This  competition  may  decrease  the  rate 
of  Cup9p  degradation.  [Dipeptides  added  to  a  culture  of 
S.  cerevisiae  have  been  shown  to  inhibit  the  N-end  rule 
pathway  (Baker  and  Varshavsky,  1991).]  The  ensuing 
increase  in  the  level  of  Cup9p  repressor  would  in  turn 
decrease  the  production  of  the  Ptr2p  transporter,  diminish¬ 
ing  the  rate  of  peptide  import.  This  ‘peptide-sensing’ 
negative  feedback  loop  would  maintain  the  intracellular 
concentration  of  short  peptides  within  a  predetermined 
range — ^potentially  a  useful  feature,  made  possible  by  the 
substrate-binding  properties  of  Ubrlp.  Experiments  to 
verify  this  model  are  under  way. 

Another  interesting  aspect  of  the  Ubrlp-Cup9p-Ptr2p 
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Fig.  5.  Degradation  of  GST-Cup9p-ha2  is  not  preceded  by  a  processing 
that  releases  a  GST-containing  fragment.  Saccharomyces  cerevisiae 
strains  carrying  pCB120,  a  low-copy  plasmid  that  expressed  GST- 
Cup9p-ha2  from  the  Pgali  promoter,  were  labeled  at  30°C  with 
F^Sjmethionine/cysteine  for  5  min,  followed  by  a  chase  for  0,  5,  10 
and  30  min,  preparation  of  extracts,  isolation  of  GST-Cup9p-ha2  on 
glutathione-agarose  beads,  SDS-PAGE  and  autoradiography/ 
quantitation  (see  Materials  and  methods).  Lanes  b-e,  JD52  (UBRJ) 
(wild-type)  cells  expressing  GST-Cup9p-ha2.  Lanes  f-i,  same  as  lanes 
b-e,  but  with  JD55  (ubrlA)  cells.  Lane  a,  molecular  mass  markers 
(in  kDa). 

regulatory  circuit  is  its  possible  relevance  to  copper 
homeostasis.  In  addition  to  functioning  as  a  repressor  of 
peptide  transport,  Cup9p  also  contributes  to  the  resistance 
of  S,  cerevisiae  to  copper  toxicity  during  growth  on  lactate, 
a  non-fermentable  carbon  source  (Knight  et  al,  1994), 
Copper  homeostasis  is  mediated  by  a  complex  set  of 
circuits,  some  of  which  also  regulate  iron  metabolism 
(Zhou  and  Thiele,  1993;  Ooi  et  al.,  1996).  The  double 
role  of  Cup9p  in  regulating  both  peptide  import  and  copper 
homeostasis  may  signify  a  physiological  connection 
between  these  seemingly  unrelated  processes. 
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Fig.  6.  A  model  for  regulation  of  peptide  import  in  S.cerevisiae.  The 
expression  of  PTR2.  which  encodes  a  transmembrane  peptide 
transponer.  is  regulated  by  the  short-lived,  homeodomain-containing 
transcriptional  repressor  Cup9p.  The  concentration  of  Cup9p  is 
controlled  in  part  through  its  degradation  by  the  N-end  rule  pathway, 
whose  targeting  components  include  Ubrlp  and  either  Ubc2p  or 
Ubc4p  {see  the  main  text). 

Our  work  adds  Cup9p  to  the  list  of  short-lived  regulatory 
proteins  whose  degradation  is  mediated  by  the  ubiquitin 
system.  Many  of  these  proteins  are  negative  regulators.  For 
example,  Mata2p,  a  homeodomain-containing  homolog  of 
Cup9p,  controls  the  mating  type  of  S.cerevisiae  (a  or  a) 
through  repression  of  a-specific  genes  (Herskowitz,  1989). 
Mata2p  appears  to  be  constitutively  short-lived  in  haploid 
cells  (Hochstrasser  and  Varshavsky,  1990).  Therefore, 
cessation  of  Mata2p  synthesis  upon  the  conversion  of  an 
a  cell  into  an  a  cell  results  in  rapid  disappearance 
of  Mata2p  and  the  establishment  of  a-specific  circuits 
(Hochstrassser,  1996).  Progression  of  the  cell  cycle  is  also 
controlled  by  short-lived  negative  regulators,  in  particular 
by  Sic  Ip,  an  inhibitor  of  CDK,  the  cyclin-dependent 
kinase  (Schwob  et  al,  1994).  In  this  case,  however,  a 
rapid  ‘on-switch’  is  based  on  the  phosphorylation-induced 
degradation  of  the  previously  stable  Sic  Ip  (King  et  al, 
1996).  Whether  the  Cup9p  repressor  (Figure  6)  is  constitu¬ 
tively  short-lived  or  whether  its  stability  is  regulated  by 
external  conditions  such  as,  for  example,  different  nitrogen 
sources,  remains  to  be  determined. 

The  finding  that  Cup9p  lacks  a  destabilizing  N-terminal 
residue  indicates  that  Ubrlp,  the  recognition  component 
of  the  N-end  rule  pathway,  is  able  to  target  substrates 
bearing  either  internal  degradation  signals  or  N-degrons. 
The  Ga  subunit  of  the  G  protein — the  other  known 
physiological  substrate  of  Ubrlp  in  S.cerevisiae — also 
lacks  a  destabilizing  N-terminal  residue  (Madura  and 
Varshavsky,  1994).  Thus,  a  substantial  fraction  of  naturally 
short-lived  proteins  targeted  by  the  N-end  rule  pathway 
may  bear  internal  degrons  rather  than  N-degrons,  a  pos¬ 
sibility  that  the  identification  of  other  physiological  sub¬ 
strates  of  this  pathway  will  address. 

Materials  and  methods 

Strains,  media  and  genetic  techniques 

Saccharomyces  cerevisiae  strains  were  grown  in  rich  (YPD)  medium 
containing  2%  peptone,  1%  yeast  extract  and  2%  glucose  or  in  synthetic 
yeast  media,  containing  0.67%  yeast  nitrogen  base  without  amino  acids 
(Difco),  auxotrophic  nutrients  at  concentrations  specified  by  Sherman 
et  al  (1986)  and  either  2%  glucose  (SD  medium),  2%  raffinose  (SM- 
raffinose  medium)  or  2%  galactose  (SM-galactose  medium)  as  carbon 
sources.  SHM-glucose  medium  used  in  halo  assays  was  prepared 
according  to  Island  et  al.  (1987)  and  was  identical  to  SD  medium  except 
that  it  lacked  methionine  and  contained  allantoin  (1  mg/ml)  and  yeast 
nitrogen  base  (Difco,  1.7  mg/ml)  without  amino  acids  and  ammonium 


sulfate.  Escherichia  coU  strain  DH5a  was  used  for  plasmid  propagation 
and  cloning  steps.  For  induction  of  the  promoter,  cells  were  grown 
to  an  A^oo'of  0.5-1  in  SM-raffinose  medium,  pelleted  and  transferred 
to  SM-galactose  medium  for  3  h. 

Halo  and  dilution  assays 

Peptide  import  was  assayed  using  the  halo  and  dilution  methods.  For 
halo  assays  (Island  et  al,  1987),  cells  were  grown  to  an  ^^(X)  of  --I  in 
SHM-glucose  medium  with  auxotrophic  supplements.  Cells  were  pelleted 
by  centrifugation  and  resuspended  in  water  to  ~5X10^  cells/ml.  Then 
0.1  ml  of  cell  suspension  was  mixed  with  10  ml  of  0.8%  noble  agar 
(Difco)  at  55*C,  and  spread  on  plates  containing  20  ml  of  SHM  medium. 
Sterile  filter  disks  containing  the  toxic  dipeptide  L-leucyl-L-ethionine 
(Leu-Eth:  5  pmol)  were  placed  in  the  middle  of  each  plate,  followed  by 
incubation  at  30®C  for  1-2  days. 

For  dilution  assays,  strains  were  grown  (under  selection  for  plasmids) 
in  SD  medium  to  an  A^oo  of  ~1.  Cells  from  each  sample  (1.5X  10^)  were 
spun  down  and  resuspended  in  1  ml  of  water.  The  samples  were  serially 
diluted  by  4-fold  in  a  microtiter  plate  (150  pl/well)  to  generate  eight 
different  initial  concentrations  of  cells  that  ranged  from  1.5X10^  to  915 
cells/ml.  The  suspensions  were  spotted  to  various  media,  using  a  48-pin 
applicator.  The  plates  were  incubated  at  30®C  for  1-2  days. 

Leu-Eth  was  synthesized  using  standard  methods  of  organic  chemistry. 
Briefly,  L-ethionine  methyl  ester  (Eth-OMe)  was  produced  from 
L-ethionine  and  methanol.  Eth-OMe  was  then  coupled  with  N-tert- 
butoxy-carbonyl-L-leucine-/>-nitrophenyl  ester  (N-/-BOC-L-leucine- 
PNP),  yielding  N-/-BOC-L-leucyl-L-ethionine  methyl  ester,  which  was 
purified  by  flash  chromatography  on  a  silica  column.  N-r-BOC-L-leucyl- 
L-ethionine  methyl  ester  was  then  converted  into  N-/-BOC-L-leucyl-L- 
ethionine  by  treatment  with  KOH.  N-/-BOC-L-leucyl-L-ethionine  was 
deprotected  with  trifluoroacetic  acid,  yielding  Leu-Eth. 

4  screen  for  import^competent  mutants  in  the  ubrIA 
background 

Thirty  5-ml  cultures  of  JD83-1 A  (leu2-3  ubrIA),  auxotrophic  for  leucine, 
were  grown  to  an  A^oo  of  ~1,  and  -1.5X10^  cells  from  each  culture 
were  plated  onto  SD  medium  lacking  leucine  and  histidine  but  containing 
the  leucyl-histidine  dipeptide  at  0.23  pM.  Plates  were  incubated  at  30°C 
for  2  days,  selecting  for  mutants  able  to  grow  on  this  medium.  Of  the 
320  sub  mutants  (‘suppressors  of  a  block  to  peptide  import  in  uhrlA  ) 
thus  obtained,  199  were  found  by  complementation  tests  to  be  clearly 
recessive.  Among  these,  101  mutants  belonged  to  one  complementation 
group,  termed  subl. 

Isolation  of  the  CUPS  (SUB1)  gene 

A  50  ml  culture  of  S.cerevisiae  CBY15  (ubrIA  subl -I)  was  grown  to 
an  Afioo  of  -1  in  SHM-glucose.  Cells  were  made  competent  with  lithium 
acetate  (Ausubel  et  al,  1992),  and  25  0.1 -ml  samples  (~7X10^  cells/ 
ml)  were  transformed  with  a  yeast  genomic  DNA  library  (American 
Type  Culture  Collection;  #77164)  in  the  TRPJ,  CENd-bascd  vector 
pRS200  (Sikorski  and  Hieter,  1989).  The  cells  were  pelleted,  and  each 
sample  was  resuspended  in  1.5  ml  of  SHM-glucose  and  incubated  at 
30°C  for  3  h.  The  cells  were  pelleted  again,  each  sample  was  resuspended 
in  0.1  ml  of  water,  added  to  10  ml  of  0.8%  Noble  agar  at  55°C,  and 
spread  onto  SHM-glucose  plates  lacking  Trp  and  containing  Leu-Eth  at 
37  pM.  Of  the  -2.5X10^  Trp^  transformants  plated,  14  could  grow  in 
the  presence  of  Leu-Eth.  Of  these,  two  yielded  the  library-derived 
plasmids,  pSUBl-1  and  pSUBl-6,  that  complemented  the  subl  mutation 
of  CBY15.  The  insert  of  pSUBl-6  contained  the  insert  of  pSUBl-1. 
Partial  sequencing  of  the  -8.2  kb  insert  of  pSUBl-1  identified  it  as  a 
region  of  the  S.cerevisiae  chromosome  XVI.  Deletion  analysis  (not 
shown)  localized  the  complementing  activity  to  an  -2.8  kb  HindlU- 
Kpnl  fragment,  whose  sequence  revealed  the  presence  of  two  ORFs 
(CUP9  and  SCYPL178W).  Further  deletion  analysis  (not  shown)  local¬ 
ized  the  complementing  activity  to  the  previously  isolated  (Knight  et  al, 
1994)  CUP9  gene.  Verification  that  CUP9  and  SUBl  were  one  and  there 
same  gene  was  carried  out  as  described  in  Results. 

Construction  of  null  cupS  mutants 

The  -2.8  kb  HindUl-Kpnl  genomic  DNA  fragment  containing  CUP9 
was  ligated  to  //wdlll-Xp/iI-cut  pRS306ASpel,  yielding  pCB117. 
pRS306ASpel  was  derived  from  the  f/RA3-bearing  pRS306  (Sikorski 
and  Hieter,  1989)  through  elimination  of  its  Spel  site.  The  CUP9  ORF 
was  disrupted  by  inserting  an  -2  kb,  blunt-ended,  LEf/2-cbntaining  Sad 
fragment  from  pJJ283  into  the  Spel  site  of  CUP9,  yielding  pCB119,  in 
which  the  LEU2  and  CUP9  ORFs  were  oriented  in  opposite  directions. 
An  -5  kb  Hindlll-Pvull  fragment  of  pCB119  containing  the  cup9::LEU2 
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Table  1.  Saccharomyces  cerevisiae  strains  used  in  this  study 

p 

Strain  Genotype  References 


DF5 

EJI 

EJ2 

EJ3 

EJIO 

eY102 

EJY105 

EJY106 

MHY495 

MHY503 

MHY507 

MHY550 

MHY552 

MHY554 

MHY557 

MHY570 

MHY599 

MHY601 

YW055 

YPH500 

BBY67 

KM207-1 

JD34 

JD5I 

JD52 

JD53 

JD55 

JD83-IA 

RJD549 

RJD795 

SGY6 

CBY4 

CBY5 

CBY6 

CBY7 

CBY8 

CBY9 

CBYIO 

CBYll 

CBYI5 

CBY16 

CBY17 

CBYI9 

CBY23 

CBY24 


MAT^MATa  trpI-l/trpJ-1  uraB-52/ura3-52  his3-A200/liisB-A200  leu2>3J}2Aeu2’3J12  lys2-801/ 
lys2’80]  gal/gal 

MATa  ubc5A::LEU2  trpl-l  ura3^52  his3-^A200  Ieu2-3J12  l\s2-801  gal 

MATa  libel A::VRA3  trpI-1  ura3-52  his3-A200  Ieu2-3J12  i\s2-801  gal 

MATa  ubc4A::HIS3  trpJ-1  ura3-52  his3-A200  Ieii2-3J12  Iys2-80J  gal 

MATa  ubc4A::HIS3  ubc5A::LEU2  trpI-J  ura3’52  lus3-A2do  Ieii2-3J12  l\s2’80I  gal 

MATa  ubc2A::LEV2  trpTl  ura3-52  his3-A200  Ieu2-3J12  lys2‘80]  gal 

MATa  ubc2A::LEU2  ubc4A::HIS3  trpl-1  ura3-52  his3-A20b  Ieii2-3J12  Iys2-80I  gal 

MATa  ubc2A::LEU2  ubc4A::HlS3  trphi  ura3-52  his3-A200  Ieu2-3J  12  l\s2-80l  gal 

MATa  ubc6-AI::HIS3  trpTl  ura3-52  his3^A200  Ieii2-3J12  l\s2-801 

MATa  ubc4-Al::HIS3  ubc6-Al::HIS3  trpTl  ura3‘52  his3-A200  leu2^3J12  lxs2-801 

MATa  ubc7::LEU2  trpl-1  ura3-52  his3-A200  Ieii2-3J12  lys2-801 

MATa  ubc4-A2::TRPl  ubc5-Al::LEU2  ubc6-Al ::H1S3  trpl-1  ura3-52  his3-A200  leu2-3.112 

lys2-801 

MATa  ubc6-Al::HlS3  ubc7::LEU2  trpl-1  ura3-52  his3-A200  l€u2-3J12  lys2-801 

MATa  ubc4-Al::HlS3  ubc7::LEU2  trpl-1  ura3-52  his3-A200  leu2-3,112  lys2-801 

MATa  ubc4-Al::HIS3  ubc6-Al::HIS3  ubc7::LEU2  trpM  ura3-52  his3-A200  Ieu2-3JI2  Iys2-801 

MATa  ubc4-A2::TRPl  ubc5-Al::LEU2  ubc6-Al::HlS3  ubc7::LEU2  trpl-1  ura3-52  Im3-A200 

Ieu2-3J12  iys2-801 

MATa  pas2  trp  ura3-52  adel  leu2-3 

MATa  ubc8::URA3  trpl-1  ura3-l  ade2-l  his3-ll  Ieu2-3J12  canl-100 

MATa  ubc9-l  trpl-1  ura3-52  bis3-A200  Ieu2-3J12  lys2-801 

MATa  trpl-A63  ura3-52  ade2-101  his3-A200  Ieu2-Al  l\s2-801 

MATa  ubc2A::LEU2  trpl-A63  ura3-52  ade2-101  his3-A200  Ieu2-Al  lys2-801 

MATa  atelA::TRPl  trpl-A63  ura3-52  ade2-101  his3-A200  Ieii2-Al  lys2-801 

MATa  ubc2A::URA3  trpl-A63  ura3-52  ade2-101  his3-A200  Ieu2-Al  lys2-801 

MATalMATa  trpl-A63/trpl-A63  ura3-52/ura3-52  his3-A200/his3-A2do  Ieii2-3J12/Ieu2-3J12 

Iys2-801/lys2-801 

MATa  trpl-A63  ura3-52  kis3-A200  Ieu2-3J12  lys2-801 

MATa  trpl-A63  ura3-52  his3-A200  Ieu2-3J12  )ys2-801 

MATa  ubrlA::HlS3  trpl-A63  ura3-52  his3-A20d  Ieu2-3J12  l\s2-801 

MATa  ubrlA::HIS3  trpl-A63  ura3-52  his3-A200  Ieu2-3J12  )ys2-801 

MATa  cdc34-l  trpl  ura3-52  leu2-3 

MATa  cdc34-2  trpl  ura3-52  leii2-3 

MATa  ntalAr.TRPl  trpl-A63  ura3-52  ade2-101  his3-A200  Ieii2-3J12  iys2-801 
MATa  ubc2A::URA3  ubc5A::LEU2  trpl-1  ura3-52  his3-A200  Ieii2-3J12  lys2-801  gal 
MATa  ubc2A::LEU2  ubc6-Al::HlS3  trpl-1  ura3-52  his3-A200  leu2-3,112  lys2-801 
MATa  iibc2A::URA3  ubc7::LEU2  trpl-1  ura3-52  his3-A200  Ieu2-3J12  l\s2-801 
MATa  ubc2A::LEU2  ubc8::URA3  trpl-1  ura3-l  ade2-l  his3-ll  Ieu2-3J12  canl-100 
MATa  ubc2A::LEU2  ubclA::URA3  trpl-1  ura3-52  his3-A200  Ieu2-3J12  l\s2-801  gal 
MATa  ubc2A::LEU2  ubc9-l  trpl-1  ura3-52  his3-A200  Ieu2-3J12  lys2-80) 

MATa  ubc2A: :LEU2  pas2  trp  ura3-52  adel  leu2-3 
MATa  ubc2A::LEU2  cdc34-l  trpl  ura3-52  l€u2-3 

MATa  subl-1  ubrlA::HIS3  trpl-A63  ura3-52  his3-A200  leu2-3,112  lys2-801 
MATa  cup9::LEU2  ubrlA::HlS3  trpl-A63  ura3-52  his3-A200  Ieu2-3J12  lvs2-801 
MATa  cup9::LEU2  ubrlA::HIS3  trpl-A63  ura3-52  his3-A200  Ieu2-3J12  iys2-801 
MATa  cup9::LEU2  trpl-A63  ura3-52  his3-A200  Ieu2-3J12  lys2-801 

MATalMATa  subl-J/cup9A::LEU2  ubrlA::HlS3/ubrlA::HlS3  trpl-A63/trpl-A63  ura3-52/ura3-52 
his3-A200/his3-A200  leu2-3J12Aeu2-3,112  Iys2-801/lys2-801 

MATalMATa  subl-l/cup9A::LEU2  ubrlA::HlS3/ubrlA::HlS3  trpl-A63/trpl-A63  ura3-52/ura3-52 
his3-A200/his3-A200  Ieu2-3J12/Ieu2-3J12  Iys2-801/lys2-801 


Finley  et  al.  (1987) 

Derivative  of  DF5^ 

Derivative  of  DF5^ 

Derivative  of  DF5“ 

Derivative  of  DF5“ 

Derivative  of  DF5“ 

Derivative  of  PF5“ 

Derivative  of  DF5^ 

Derivative  of  DF5^ 

Derivative  of  DF5*’ 

Derivative  of  DFS*’ 

Derivative  of  DF5^ 

Derivative  of  DF5^ 

Derivative  of  DF5^ 

Derivative  of  DF5^ 

Derivative  of  DF5^ 

Chen  et  aL  (1993) 

Chen  et  al.  (1993) 

Derivative  of  DFS^^ 

Sikorski  and  Hieter  (1989) 
Derivative  of  YPH500^ 
Derivative  of  YPH5(X)^ 
Derivative  of  YPH500^ 

Dohmen  et  al.  (1995) 

Johnson  et  al.  (1995) 

Dohmen  et  al.  (1995) 

Madura  and  Varshavsky  (1994) 
Derivative  of  JD5l‘^ 

R.Deshaies^ 

R. Deshaies^ 

S.  Grigoryev*^ 

Derivative  of  EJ 1 
Derivative  of  MHY495 
Derivative  of  MHY507 
Derivative  of  MHY601 
Derivative  of  EJ2 
Derivative  of  YW055 
Derivative  of  MHY599 
Derivative  of  RJD549 
Derivative  of  JD83-1A 
Derivative  of  JD55 
Derivative  of  JD55 
Derivative  of  JD52 
Produced  by  mating  CBY15  and 
CBY16 

Produced  bv  mating  CBY15  and 
CBYI7 


“Johnson  et  al.  (1992,  1995).  A  gift  from  EJohnson,  the  Rockefeller  University,  New  York,  NY  10021-6399,  USA. 

^l^n  et  al.  (1993).  A  gift  from  M.Hochstrasser,  Department  of  Biochemistry  and  Molecular  Biology,  University  of  Chicago,  Chicago,  IL  60637, 

‘^A  gift  from  S.Jentsch,  ZMBH,  Universitat  Heidelberg,  69120  Heidelberg,  Germany. 

‘^Dohmen  et  al  (1990). 

®A  gift  from  K.Madura,  Department  of  Biochemistry,  UMDNJ-Johnson  Medical  School,  Piscataway,  NJ  08854,  USA. 

*^A  gift  from  J.Dohmen,  Heinrich-Heine-Universitat,  Institut  fur  Mikrobiologie,  40225  Dusseldorf,  Germany. 

8A  gift  from  R.Deshaies,  Division  of  Biology,  Caltech,  Pasadena,  CA  91125,  USA. 

‘‘A  gift  from  S. Grigoryev,  Department  of  Biology,  University  of  Massachusetts,  Amherst,  MA  01003,  USA. 


disruption  allele  was  used  to  replace  the  wild-type  CUP9  alleles  of  JD52 
(wild-type)  and  JD55  (ubrJA)  by  homologous  recombination  (Rothstein, 
1991),  generating  strains  CBY19  and  CBY17,  respectively. 

CUPS-expressing  plasmids 

The  plasmid  pCB116,  which  expressed  Cup9p-FLAG  from  the  Pgali 
promoter,  was  constructed  by  subcloning  an  ~1  kb  BamUl-EcoRl 
fragment  containing  the  CUP9-FLAG  ORF  into  the  BamY{\-EcoR\  site(s) 
of  p416GALl  (Mumberg  et  al,  1994).  The  CUP9-FLAG-con\ammg 
fragment  was  constructed  by  PCR  amplification  of  the  CUP9  ORF  of 


plasmid  pCBlll  using  primers  PCBl  (5'-CGCGGATCCGAATAGT- 
TACATTCGAAGATG-3')  and  PCB6  (5'-CCGGAATTCTCATTTA- 
TCATCATCGTCTTTGTAATCATTCATATCAGGGTTGGATAG-3'), 

resulting  in  the  addition  of  the  8-residue  FLAG  epitope,  DYKDDDDK, 
to  the  C-terminus  of  Cup9p.  pCB  1 1 1  was  constructed  by  subcloning  the 
~2.8  kb  HM\\\-Kpn\  fragment  of  the  CW’P-containing  pSUBl-1  (see 
above)  into  ////idlll-^Tp/iI-cut  pRS316  (Sikorski  and  Hieter,  1989). 
pCB202  was  constructed  by  subcloning  an  ~1  kb  fragment  containing 
^CUP9  promoter  into  HindlW-BamWl-cvX  pCB201.  [The  ^-1  kb 
fragment  was  produced  by  PCR  from  pCBlll  using  primers  PCB8 
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(5'-GTGTTAGTAAGCTTGTAAAGGAATGCACGTATT-3')  and  PCB9 
(5'-<;:CCGCGGATCCGCATGCAACTATTCTCGAAGGTTGT-3').] 
pCB200  (ARS-CEN,  LEU2)  and  pCB20l  (2p,  LEU2)  were  constructed 
by  replacing  the  517  bp  5cY/I-£cr)RI  fragment  of  pBR322  (Ausubel 
et  al,  1992)  with,  respectively,  the  3822  bp  fragment  of 

pRS415  (Sikorski  and  Hieter,  1989)  and  the  4650  bp  Sca\-Nae\  fragment 
of  pRS425  (Christianson  et  ai.  1992). 

The  plasmid  pCB209  (2p,  LEU2),  which  expressed  CUP9  from  the 
Pcupg  promoter,  was  constructed  by  replacing  the  Sphl-SaR  fragment 
of  pCB202  with  an  kb  fragment  containing  the  CUP9  ORF  that 
was  produced  by  PCR  from  pCBlll.  using  primers  PCBIO  (5'-CCC- 
GCGGATCCGCATGCGAAGATGAATTATAACTGC-3')  and  PCB12 
(5'-CCCGCGCGGTCGACCTCAATTCATATCAGGGrrGGATAG-3'). 
pCB210  (ARS-CEN,  LEU2)  that  expressed  Cup9p-FLAG  from  the  PcuP9 
promoter  was  constructed  by  replacing  the  Sphl-SaR  fragment  of  pCB202 
with  an  -rl  kb  fragment  containing  the  CUP9-FLAG  ORF,  which  was 
produced  from  pCBlll  using  primers  PCBIO  (see  above)  and  PCBI3 
(5'-CCCGCGCGGTCGACCTCATTTATCATCATCGTCTTTGTA- 
ATCATTCATATCAGGGTTGGATAG-3'),  yielding  pCB211.  The  ~2  kb 
///ndIU-5^7/I  fragment  of  pCB211  containing  PcuP9  ^he  CUP9- 
FLAG  ORF  was  subcloned  into  pCB200,  yielding  pCB210. 

Plasmid  pCB  120,  expressing  GST-Cup9p-ha2  from  the  Pgali  promoter, 
was  constructed  by  subcloning  the  -1.6  kb  Xbal-EcoKl  fragment, 
containing  the  GST-CVP9-ha2  ORF,  •  into  the  Xbal-EcoRA  site(s)  of 
p416GALl.  The  Xbal-EcoRl  fragment  was  produced  by  PCR  amplifica¬ 
tion  of  the  GST-CUP9-ha^  ORF  of  pGEX-2T-CUP9-ha't,  using  the 
primers  PCB3  (5'-CCGGAATTCTCAAGCGTAATCTGGAACATC- 
GTATGGGTAAGCGTAATCTGGAACATCGTATGGGTAATTCATA- 
TCAGGGTTGGATAG-3')  and  PCB5  (5'-TGCTCTAGAACAGT- 
ATTCATGTCCCCTATA-3').  pGEX-2T-CUP9-ha2  was  constructed  by 
subcloning  an  -1  kb  fragment  containing  the  CUP9-ha2  ORF  into  the 
BaniHl-EcoRl  site(s)  of  pGEX-2T  (Pharmacia),  resulting  in  an  in-frame 
fusion  of  the  sequence  encoding  26  kDa  glutathione  5-transferase  (GST) 
domain  of  Saccharomyces  japonicum  (Smith  and  Johnson,  1988)  to  the 
second  codon  of  CUP9.  The  Ci/FP-Zra^-containing  fragment  was  pro¬ 
duced  by  PCR  amplification  of  the  CUP9  ORF  of  pCBlll  using  the 
primers  PCB3  (see  above)  and  PCB4  (5'-CGCGGATCCAATTAT- 
AACTGCGAAATACAAAAC-3').  This  step  added  to  the  C-terminus  of 
Cup9p  a  sequence  encoding  a  tandem  repeat  of  the  9-residue  sequence 
YPYDVPDYA,  derived  from  hemagglutinin  (ha)  of  influenza  virus. 

Northern  hybridization 

RNA  was  isolated  from  S.cerevisiae  as  described  (Schmitt  et  al,  1990). 
Electrophoresis  of  the  RNA  samples  was  carried  out  on  a  formaldehyde 
RNA  gel  (Ausubel  et  al,  1992).  An  -50-|ig  RNA  sample  was  loaded 
on  a  1  %  agarose  gel  containing  1 X  MOPS  buffer,  0.74%  (v/v)  formalde¬ 
hyde,  1.9  mg/ml  iodoacetamide  and  0.5  |ig/ml  ethidium  bromide. 
Electrophoresis  was  carried  out  in  IX  MOPS  buffer  at  5  V/cm. 
RNA  was  transferred  to  BrightStar-Plus  membrane  (Ambion)  using 
TurboBlotter  (Schleicher  &  Schuell)  and  Ambion  RNA  transfer  buffer. 
RNA  was  crosslinked  to  the  air-dried  membranes  using  254  nm  light 
(Ausubel  et  al,  1992). 

DNA  probes  were  prepared  by  the  random  priming  method  (Ausubel 
et  al,  1992)  using  [^-PJdCTP  and  a  DNA  labeling  kit  (Pharmacia). 
Hybridization  was  carried  out  for  8-16  h  at  42°C  in  Prehybridization/ 
Hybridization  Solution  (Ambion).  Filters  were  washed  according  to  the 
manufacturer’s  protocol  and  subjected  to  autoradiography. 

Gel  shift  assay 

PCR  was  used  to  extend  the  Cup9p  ORF  with  a  sequence  encoding 
Ser-Gly-Gly-Thr-Hise,  yielding  Cup9p-H6,  and  to  engineer  flanking 
restriction  sites  (Ndel  and  BamHl)  for  insertion  into  pET-llc  (Novagen). 
Cup9p-H6  was  overexpressed  in  E.coli  BL21  (DE3)  (Novagen)  (Ausubel 
et  al,  1992)  and  purified  on  a  3  ml  Ni-NTA  column  (Qiagen),  using  a 
linear  gradient  of  imidazole.  Cup9p-H6  eluted  at  -0.25  M  imidazole 
(-90%  pure  at  this  step);  it  was  dialyzed  at  4°C  against  10%  glycerol, 
0.1  M  KCl,  1  mM  EDTA,  0.5  mM  dithiothreitol,  20  mM  HEPES, 
pH  7.9,  and  then  snap-frozen  in  multiple  samples  in  liquid  N2,  and 
stored  at  -80°C.  The  proximal  (-1  to  -447)  and  distal  (-448  to  -897) 
PTR2  promoter  probes  for  the  gel  shift  assay  were  constructed  by  PCR 
amplification  in  the  presence  of  [a-^^P]dCTT,  and  were  purified  using 
spin  columns  (Qiagen).  The  gel  shift  reactions  (20  ^1)  contained 
50  |ig/ml  poly-dl  dC  (Pharmacia);  -1.5  |xg/ml  (500  c.p.m.)  DNA  probe; 

1  mg/ml  acetylated  serum  albumin  (New  England  Biolabs)  and  either 
0.1,  0.5  or  1  |Xg/ml  of  Cup9p-H6  in  10%  glycerol,  0.1  M  KCl,  2.5  mM 
MgCl2,  1  mM  EDTA,  20  mM  HEPES,  pH  7.9.  The  samples  were 
incubated  for  30  min  at  room  temperature,  then  loaded  onto  a  4% 


polyacrylamide  gel  (40:1.  acrylamide:Z7/.?-acrylamide)  in  0.5 X  TBE 
(Ausubel  et  al.,  1992).  and  electrophoresed  at  10  V/cm  for  3  h  at  4‘’C. 
followed  by  autoradiography. 

Pulse-chase  analysis  of  Cup9p 

One  hundred  ml  cultures  of  S.cerevisiae  JD52  (UBRI)  and  JD55  (ubrlA) 
carrying  either  pCB2I0  (expressing  Cup9p-FLAG  from  the  CUP9 
promoter)  or  pCB200  (vector  alone)  were  grown  to  an  A^oo  of  -  I  in 
SD(-Leu)  medium.  Cells  (50  units  total)  from  each  of  the  four  cul¬ 
tures  were  gently  pelleted  by  centrifugation,  washed  with  5  ml  of  SD(- 
Leu),  pelleted  again,  resuspended  in  2  ml  of  SD(-Leu),  and  incubated 
at  30®C  for  10  min.  Each  sample  was  labeled  for  5  min  with  1.4  mCi 
of  [-^-‘^SJmethionine/cysteine  (EXPRESS,  New  England  Nuclear)  at  30°C, 
followed  by  pelleting  in  a  microfuge  for  - 15  s.  The  cells  were  resuspended 
in  2.6  ml  of  SD(-Leu),  5  mM  L-methionine,  5  mM  L-cysieine,  and 
incubated  at  30®C.  Samples  of  0.5  ml  were  withdrawn  during  the 
incubation,  pelleted  and  resuspended  in  0.15  ml  of  0.5  M  NaCl-Lysis 
Buffer  (1%  Triton  X-lOO,  0.5  M  NaCl,  5  mM  EDTA,  50  mM  Na- 
HEPES,  pH  7.5)  containing  a  mixture  of  protease  inhibitors  (Ghislain 
et  al,  1996).  Glass  beads  (0.5  mm)  were  added,  and  cells  were  disrupted 
by  vortexing  (six  times,  for  30  s  each,  with  1  min  incubations  on  ice  in 
between),  followed  by  the  adjustment  of  NaCl  concentration  to  0.15  M 
through  the  addition  of  75  mM  NaCl-Lysis  Buffer,  further  vortexing  for 
30  s.  and  centrifugation  at  12  000  g  for  10  min.  The  volumes  of 
supernatants  were  adjusted  to  equalize  the  amounts  of  10%  trichloroacetic 
acid-insoluble  Cup9p-FLAG  was  immunoprecipitated  by  the  addition 
of  20  pi  of  the  monoclonal  anti-FLAG  M2  antibody  conjugated  to 
agarose  beads  (Kodak).  Suspensions  were  incubated  at  0°C  for  I  h,  with 
rotation,  then  centrifuged  at  12  000  g  for  30  s.  and  washed  four  times 
with  0.8  ml  of  0.15  M  NaCl-Lysis  Buffer.  The  pellets  were  resuspended 
in  SDS-sample  buffer,  heated  at  100®C  for  3  min.  and  subjected  to  SDS- 
12%  PAGE,  followed  by  autoradiography  and  quantitation  using  a 
Phosphorlmager  (Molecular  Dynamics). 

Pulse-chase  analysis  of  GST-Cup9p-ha2  was  carried  out  as  described 
by  Bartel  et  al  (1990).  Approximately  10  A^oo  units  of  galactose-induced 
cells  were  labeled  for  5  min  with  0.3  mCi  of  [-^‘’S] EXPRESS  in  400  pi 
SM-galactose  (-Ura)  at  30®C.  The  cells  were  then  transferred  to 
microfuge  tubes,  pelleted  and  resuspended  in  500  pi  of  SD  (-Ura), 
5  mM  L-methionine,  5  mM  L-cysteine.  Samples  of  0. 1  ml  were  withdrawn 
during  the  incubation,  pelleted  and  lysed  as  above.  The  -^''S-labeled  GST- 
Cup9p-ha2  was  purified  using  glutathione-agarose  beads  (Sigma)  which 
had  been  blocked  with  bovine  serum  albumin  (BSA;  10  mg/ml).  Twenty 
pi  of  glutathione-agarose  beads  were  added  to  each  sample  and  the 
suspensions  were  incubated  at  0®C  for  60  min,  with  rotation,  followed 
by  washes  and  electrophoretic  analyses  as  described  for  Cup9p-FLAG. 

Purification  and  N-terminal  sequencing  of  CupSp-FLAG 

Four  2-1  cultures  of  JD55  (ubrlA)  carrying  pCB116  that  expressed 
Cup9p-FLAG  from  the  GALJ  promoter  were  grown  under  selection  in 
SM-raffinose  to  an  Agoo  of  -^0.8,  followed  by  transfer  to  SM-galactose 
and  incubation  at  30'’C  for  3  h.  Longer  induction  times  resulted  in  a 
Cup9p-mediated  cytotoxicity  and  lower  yields  of  Cup9p-FLAG.  The 
cells  (-1X10^^)  were  harvested  and  lysed  at  4®C  as  described  by  Burgers 
(1995).  The  extract  was  fractionated  by  precipitation  with  0.4%  Polymin  P 
(Sigma)  and  then  further  by  precipitation  with  48%  saturated  ammonium 
sulfate.  The  pellet  was  dissolved  in  3  ml  of  TBS  buffer  (0.15  M  NaCl, 
50  mM  Tris-HCl,  pH  7.5),  and  passed  through  Sephadex  G-25  in  TBS. 
The  resulting  sample  (8.3  ml,  -70  mg/ml  of  protein)  was  applied  to  a 
column  (1  ml)  of  the  monoclonal  anti-FLAG  M2  antibody  (Kodak).  The 
column  was  washed  three  times  with  10  ml  of  TBS,  and  Cup9p-FLAG 
was  eluted  by  the  addition  of  five  1-ml  samples  of  TBS  containing, 
respectively,  50,  100,  100,  200  and  200  pg/ml  of  the  FLAG  peptide 
(Kodak).  Peak  Cup9p-FLAG  fractions  (detected  by  immunoblotting) 
were  concentrated  by  partial  lyophilization,  followed  by  precipitation 
with  methanol  (Wessel  and  Flugge,  1984)  in  the  presence  of  human 
insulin  (Sigma,  0.3  mg/ml)  as  a  carrier.  The  resulting  sample  was 
fractionated  by  SDS-12%  PAGE  and  electroblotted  onto  Pro-Blot 
membrane  (Perkin-Elmer).  After  a  brief  staining  with  Coomassie,  the 
band  of  the  37  kDa  Cup9p-FLAG  (-15  pmol)  was  excised  from  the 
membrane.  Half  of  the  sample  was  used  to  determine  the  amino  acid 
composition;  the  other  half  was  subjected  to  N-terminal  sequencing  for 
seven  cycles,  using  the  Applied  Biosystems  476A  protein  sequencer  at 
the  Caltech  Microchemistry  Facility. 
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The  N-end  rule  relates  the  in  vivo  half-life  of  a  protein  to  the  identity  of  its  N-terminal  residue. 
Similar  but  distinct  versions  of  the  N-end  rule  operate  in  all  organisms  examined,  from  mammals 
to  fungi  and  bacteria.  In  eukaryotes,  the  N-end  rule  pathway  is  a  part  of  the  ubiquitin  system. 
Ubiquitin  is  a  76-residue  protein  whose  covalent  conjugation  to  other  proteins  plays  a  role  in 
many  biological  processes,  including  cell  growth  and  differentiation.  I  discuss  the  current 
understanding  of  the  N-end  rule  pathway. 


introduction 

Many  intracellular  proteins  are  metabolically  unstable, 
or  can  become  unstable  during  their  lifetime  in  a  cell. 
The  functions  of  intracellular  proteolysis  include  the 
elimination  of  abnormal  proteins,  the  maintenance  of 
amino  acid  pools  in  cells  affected  by  stresses  such  as 
starvation,  and  the  generation  of  protein  fragments  that 
act  as  hormones,  antigens  or  other  effectors.  Yet  another 
function  of  proteolytic  pathways  is  the  selective 
destruction  of  proteins  whose  concentrations  must 
vary  with  time,  and  alterations  in  the  state  of  a  cell. 
Metabolic  instability  is  a  property  of  many  regulatory 
proteins.  A  short  in  vivo  half-life^  of  a  regulator  provides 
a  way  to  generate  its  spatial  gradients  and  allows  for 
rapid  adjustments  of  its  concentration  (or  subunit 
composition)  through  changes  in  the  rate  of  its 
synthesis.  A  protein  can  also  be  conditionally  unstable, 
i.e.,  long-lived  or  short-lived  depending  on  the  state  of 
a  cell.  Conditionally  short-lived  regulators  are  often 
deployed  as  components  of  control  circuits.  One 
example  is  cyclins — a  family  of  related  proteins  whose 
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®  The  in  vivo  degradation  of  many  short-lived  proteins, 
including  the  engineered  N-end  rule  substrates,  deviates 
from  first-order  kinetics  (Baker  &  Varshavsky  1991).  There¬ 
fore  the  term  'half-life',  if  applied  to  an  entire  decay  curve,  is 
a  useful  but  often  crude  approximation.  A  more  rigorous 
terminology  for  describing  nonexponential  decay  was 
proposed  by  Levy  et  a/.  (1 996). 


destruction  at  specific  stages  of  the  cell  cycle  regulates 
cell  division  and  growth  (Murray  &  Hunt  1993).  In 
addition,  many  proteins  are  long-lived  as  components  of 
larger  complexes  such  as  ribosomes  and  oligomeric 
proteins,  but  are  metabolically  unstable  as  free  subunits. 
The  short  in  vivo  lifetimes  of  free  subunits  allow  for  a  less 
stringent  control  over  the  relative  rates  of  their 
synthesis,  since  a  subunit  produced  in  excess  would 
not  accumulate  to  a  significant  level. 

Features  of  proteins  that  confer  metabolic  instability 
are  called  degradation  signals,  or  degrons  (Varshavsky 
1991).  The  essential  component  of  one  degradation 
signal — the  first  to  be  discovered — is  a  destabilizing 
N-terminal  residue  of  a  protein  (Bachmair  et  al,  1986; 
Varshavsky  1992,  1996a).  This  signal  is  called  the 
N-degron.  A  set  of  N-degrons  containing  different 
destabilizing  residues  yields  a  rule,  termed  the  N-end 
rule,  which  relates  the  in  vivo  half-life  of  a  protein  to  the 
identity  of  its  N-terminal  residue  (Table  1  and  Fig.  1). 
The  N-end  rule  pathway  is  present  in  all  organisms 
examined,  including  the  bacterium  Escherichia  coli 
(Tobias  et  al.  1991;  Shrader  et  al  1993),  the  yeast 
(fungus)  Saccharomyces  cerevisiae  (Bachmair  &  Varshavsky 
1989),  and  mammalian  cells  (Gonda  et  al  1989;  Levy 
et  al  1996)  (Fig.  1). 

The  N-end  rule  was  encountered  in  experiments  that 
explored  the  metabolic  fate  of  a  fusion  between  Ub  and 
a  reporter  protein  such  as  E.  coli  jS-galactosidase  (iSgal)  in 
S.  cerevisiae  (Bachmair  et  al  1986).  In  yeast  and  other 
eukaryotes,  Ub-X-jSgal  is  cleaved,  cotranslationally  or 
nearly  so,  by  Ub-specific  processing  proteases  at  the 
Ub-jSgal  junction.  This  cleavage  takes  place  regardless 
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Figure  1  Comparison  of  eukaryotic  and  bacterial  N-end  rules.  Open  circles  denote  stabilizing  residues.  Purple  and  red  circles  denote, 
respectively,  type  1  and  type  2  primary  desubiUzing  residues.  Blue  triangles  denote  secondary  destabilizing  residues.  Green  crosses 
denote  tertiary  destabilizing  residues  (Varshavsky  1996a).  A  question  mark  at  Pro  indicates  its  uncertain  status  (see  the  mam  text).  A 
question  mark  above  Ser  indicates  its  uncertain  status  in  the  reticulocyte  N-end  rule  (Levy  et  al.  1996).  Single-letter  abbrevianons  for 
amino  acids;  A,  Ala;  C,  Cys;  D,  Asp;  E,  Glu;  F,  Phe;  G,  Gly;  H,  His;  I,  He;  K,  Lys;  L,  Leu;  M,  Met;  N,  Asn;  P,  Pro;  Q,  Gin;  R,  Arg;  S,  Ser; 
T,  Thr;  V,  Val;  W,  Trp;  Y,  Tyr. 


of  the  identity  of  the  residue  X  at  the  C-terminal  side  of 
the  cleavage  site,  proline  being  the  single  exception.  By 
allovving  a  bypass  of  the  normal  N-terminal  processing 
of  a  newly  formed  protein,  this  finding  (Fig.  2A) 
yielded  an  in  vivo  method  for  generating  different 
residues  at  the  N-termini  of  otherwise  identical 
proteins — a  technical  advance  that  led  to  the  N-end 
rule  (Varshavsky  1992,  1996a). 

In  eukaryotes,  the  N-degron  comprises  at  least  two 
determinants:  a  destabilizing  N-terminal  residue  and  an 
internal  lysine  (or  lysines)  of  a  substrate  (Fig.  2B) 


(Bachmair  &  Varshavsky  1989;  Johnson  et  al.  1990; 
Dohmen  et  al.  1994).  The  Lys  residue  is  the  site  of 
formation  of  a  multiubiquitin  chain  (Chau  et  al.  1989). 
Ubiquitin  (Ub)  is  a  76-residue  protein  whose  covalent 
conjugation  to  other  proteins  is  involved  in  a  multitude 
of  processes — cell  growth  and  differentiation,  signal 
transduction,  DNA  repair,  transmembrane  traffic,  and 
responses  to  stress,  including  the  immune  response.  In 
many  of  these  settings,  Ub  acts  through  routes  that 
involve  the  processive  degradation  of  Ub-protein  con¬ 
jugates  (Hershko  1991;Jentsch  1992;  Varshavsky  1996a). 
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Table  1  The  N-end  rule  in  £.  colt  and  S.  cerevisiae 


Residue  X 

Half-life  of  X-j8gal 

E.  coli 

S.  cerevisiae 

Arg 

2min 

2  min 

Lys 

2min 

3  min 

Phe 

2  min 

3  min 

Leu 

2  min 

3  min 

Trp 

2  min 

3  min 

Tyr 

2  min 

10  min 

His 

>  lOh 

3  min 

He 

>10h 

30  min 

Asp 

>10h 

3  min 

Glu 

>10h 

30  min 

Asn 

>10h 

3min 

Gin 

>10h 

lOmin 

Cys 

>10h 

>30h 

Ala 

>10h 

>30h 

Ser 

>10h 

>30h 

Thr 

>10h 

>30h 

Gly 

>10h 

>30h 

Val 

>10h 

>30h 

Pro 

a 

>5h 

Met 

>10h 

>30h 

Approximate  in  vivo  half-lives  of  X-jSgal  proteins  in  E.  coli  at 
36  ®C  (Tobias  et  al  1991)  and  in  S.  cerevisiae  at  30  ®C  (Bachmair  & 
Vanhavsky  1989).  A  question  mark  at  Pro  indicates  its  uncertain 
status  in  the  N-end  rule  (see  the  main  text). 


The  binding  of  an  N-end  rule  substrate  by  a  targeting 
complex  is  followed  by  the  formation  of  a  substrate- 
linked  multi-Ub  chain  (Dohmen  et  al.  1991).  The 
ubiquitylated  substrate  is  processively  degraded  by  the 
26S  proteasome — an  ATP-dependent,  multisubunit 
protease  (Rechsteiner  et  al  1993).  The  N-end  rule 
pathway  is  present  in  both  the  cytosol  (Bachmair  et  al 
1986)  and  the  nucleus  (J.A.  Johnston  &c  A.V., 
unpublished  data).  In  this  paper,  I  summarize  the 
current  understanding  of  the  N-end  rule.  For  a  more 
detailed  review,  see  Varshavsky  et  al.  (1997). 

Components  and  evolution  of  the  N-end 
rule  pathway 

The  N-end  rule  is  organized  hierarchically.  In  S.  cerevisiae, 
Asn  and  Gin  are  tertiary  destabilizing  N-terminal 
residues  in  that  they  function  through  their  conver¬ 
sion,  by  enzymatic  deamidation,  into  the  secondary 
destabilizing  N-terminal  residues  Asp  and  Glu,  whose 
activity  requires  their  conjugation,  by  Arg-tRNA- 
protein  transferase  (R-transferase),  to  Arg,  one  of  the 
primary  destabilizing  N-terminal  residues  (Gonda  et  al. 
1989;  Baizi  et  al  1990;  Baker  &  Varshavsky  1995).  The 
primary  destabilizing  residues  are  bound  directly  by 
N-recognin  (also  called  E3),  the  recognition  component 
of  the  N-end  rule  pathway  (Fig.  3), 

N-recognin  (E3) 

In  S.  cerevisiae,  N-recognin  is  a  225  kDa  protein 
(encoded  by  UBR1)  that  selects  potential  N-end  rule 


Figure  2  Mechanics  of  the  N-end  rule.  (A)  The  ubiquitin  fusion  technique.  Linear  fusions  of  Ub  to  other  proteins  are  cleaved  at  the 
last  residue  of  Ub,  making  it  possible  to  produce  different  residues  at  the  N-termini  of  otherwise  identical  proteins  (Bachmair  et  al  1 986: 
Gonda  et  al  1989).  Amino  acid  residues  in  blue  and  red  are  stabilizing  and  destabilizing,  respectively,  in  the  S.  cerevisiae  N-end  rule 
(Bachmair  &  Varshavsky  1989).  (B)  The  two-determinant  organization  of  eukaryotic  N-degrons.  d,  a  destabilizing  N-terminal  residue. 
A  chain  of  black  ovals  linked  to  the  second-determinant  lysine  (K)  denotes  a  multi-Ub  chain.  (C)  Cis  recognition  of  the  N-degron  in 
one  subunit  of  a  dimeric  protein.  The  other  subunit  bears  s,  a  stabilizing  N-terminal  residue.  (D)  Trans  recognition,  in  which  the  first  (d) 
and  second  (K)  determinants  of  the  N-degron  reside  in  different  subunits  of  a  dimeric  protein  (Johnson  er  al.  1990).  (E)  The  hairpin 
insertion  model.  A  targeted  N-end  rule  substrate  (in  green)  bearing  a  multi-Ub  chain  is  shown  bound  to  the  26$  proteasome  through 
the  chain.  The  position  of  a  targeting  complex  containing  N-recognin  is  unknown,  and  is  left  unspecified.  Only  the  20$  core 
component  of  the  26$  proteasome  is  shown.  A  red  arrow  indicates  the  direction  of  net  movement  of  the  substrate  s  polypeptide  chain 
toward  active  sites  in  the  interior  of  proteasome.  By  analogy  with  the  arrangement  of  signal  sequences  during  transmembrane 
translocation  of  proteins  (Schatz  &  Dobberstein  1996),  it  is  proposed  that  a  region  of  the  substrate  upstream  of  its  ubiquitylated  lysine  (K) 
does  not  move  through  the  proteasome  during  the  substrate’s  degradation,  and  may  be  released  intact  following  a  cleavage  downstream 
of  the  lysine.  Variants  of  this  model  may  also  be  relevant  to  the  targeting  of  proteins  that  bear  internal  or  C-terminal  degrons.  (F)  A 
model  for  the  recognition  of  an  N-end  rule  substrate  (Bachmair  &  Varshavsky  1989).  The  reversible  binding  of  N-recognin  to  a  primary 
destabilizing  N-terminal  residue  (d)  of  a  substrate  (step  I)  must  be  followed  by  a  capture  of  the  second-determinant  lysine  (K)  of  the 
substrate  by  a  targeting  complex  containing  a  Ub-conjugating  (E2)  enzyme  (step  11).  It  is  unknown  whether  the  lysine  is  captured  by  E2 
(as  shown  here)  or  by  N-recognin.  Ubiquirylation  of  the  substrate  commences  once  the  targeting  complex  is  bound  to  both 
determinants  of  the  N-degron  (step  HI).  This  model  does  not  specify,  among  other  things,  the  details  of  Ub  conjugation  (see  the  main 
text). 
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Figure  3  The  S.  cerevhiae  N-end  rule  pathway.  Type  1  and  type  2  primary  destabilizing  N-terminal  residues  are  in  purple  and  red, 
respectively  Secondary  and  tertiary  destabilizing  N-terminal  residues  are  in  blue  and  green,  respectively  The  yeUow  ovals  denote  *e 
rest  of  a  protein  substrate.  The  conversion  of  tertiary  destabilizing  residues  N  and  Q  into  secondary  destabilizing  residues  D  and  E  is 
mediated  by  N-terminal  amidohydrolase  (Nt-amidase),  encoded  by  NTAl.  The  conjugation  of  the  primary  destabihzmg  residue  R  to 
the  secondary  destabilizing  residues  D  and  E  is  mediated  by  Atg-tRNA-protein  transferase  (R-transferase),  encoded  by  ^ 

complex  of  N-recognin  and  the  Ub-conjugating  (E2)  enzyme  Ubc2p  catalyses  the  conjugation  of  activated  Ub,  bV 

Ub-activating  (El)  enzyme  Ubalp,  to  a  Lys  residue  of  the  substrate,  yielding  a  substrate-linked  multi-Ub  chain.  Ubalp~Ub  and 
Ubc2p~Ub  denote  covalent  (thioester-mediated)  complexes  of  these  enzymes  with  Ub.  A  multiubiquitylated  substrate  is  degraded  by 
the  26S  proteasome.  Inset:  A  model  of  the  targeting  complex.  The  20  kDa  Ubc2p  E2  enzyme  is  depicted  carrying  actirated  linked  to 
Cys-88  of  Ubc2p  through  a  thioester  bond.  Both  the  52  kDa  Ntalp  (Nt-amidase)  and  the  58  kDa  Atelp  (R-transferase)  bind  to  the 
225  kDa  Ubrlp  (N-recognin)  in  proximity  to  the  type  1  substrate-binding  site  of  Ubrlp.  In  addition,  Ntalp  directly  interacts  with  Atelp 

(see  the  main  text). 


Figure  4  The  E.  coli  N-end  rule  pathway.  Primary  destabilizing 
N-terminal  residues  L,  F,  W  and  Y  are  in  red.  Secondary 
destabilizing  N-terminal  residues  R  and  K  are  in  blue.  The 
yellow  ovals  denote  the  rest  of  a  protein  substrate.  Conjugation  of 
the  primary  destabilizing  residue  L  to  the  secondary  destabilizing 
residues  R  and  K  is  mediated  by  Leu,  Phe-tRNA-protein 
transferase  (L/F-transferase),  encoded  by  oat  (Tobias  et  al.  1991). 
In  vivo,  L/F-transferase  appears  to  conjugate  predominantly  if  not 
exclusively  L  (Shrader  et  al.  1993).  The  degradation  of  a  substrate 
bearing  a  primary  destabilizing  N-terminal  residue  is  carried  out 
by  the  ATP-dependent  protease  ClpAP,  encoded  by  clpA  and 
clpP.  A  question  mark  denotes  an  ambiguity  about  the  nature  of 
N-recognin  in  E.  coli. 


16  Genes  to  Cells  (1997)  2,  13-28 


©  BlackweU  Science  Limited 


The  N-end  rule 


substrates  through  the  binding  to  their  primary 
destabilizing  N-terminal  residues  Phe,  Leu,  Trp,  Tyr, 
He,  Arg,  Lys  or  His  (Bartel  et  al  1990;  Varshavsky 
1996a).  N-recognin  has  at  least  two  substrate-binding 
sites.  The  type  1  site  is  specific  for  the  basic  N-terminal 
residues  Arg,  Lys  and  His.  The  type  2  site  is  specific  for 
the  bulky  hydrophobic  N-terminal  residues  Phe,  Leu, 
Trp,  Tyr  and  He  (Fig.  3).  At  present,  these  sites  are 
defined  through  dipeptide-based  competition  experi¬ 
ments.  Specifically,  a  dipeptide  bearing  a  destabilizing 
N-terminal  residue  was  found  to  inhibit  the  degradation 
of  a  test  N-end  rule  substrate  if  that  substrate’s 
N-terminal  residue  was  of  the  same  type  (1  or 
2)  as  the  dipeptide’s  N-terminal  residue  (Reiss  et  al  1988; 
Gonda  et  al  1989). 

A  genetic  dissection  of  the  type  1  and  type  2  sites  in  S. 
cerevisiae  N-recognin  (Ubrlp)  has  shown  that  either  of  the 
sites  can  be  mutationally  inactivated  without  significandy 
perturbing  the  other  site.  Mutations  that  selectively 
inactivate  the  type  1  or  the  type  2  site  are  located 
within  the  «50kDa  N-terminal  region  of  the  225  kDa 
N-recognin  (A.  Webster,  M.  Ghislain  &  A.  V,  unpub¬ 
lished  data).  E3a,  the  mammalian  counterpart  of  S. 
cerevisiae  N-recognin,  has  been  characterized  biochemi¬ 
cally  in  extracts  from  rabbit  reticulocytes  (Henhko  1991). 
Another  mammalian  N-recognin,  termed  £3)8,  which 
apparendy  binds  to  substrates  bearing  N-terminal  ^  and 
Thr  (and  possibly  also  Ser  (Levy  et  al,  1996)),  has  been 
described  as  well  (Hershko  1991). 

All  eukaryotes  examined  have  both  Ub  and  the 
N-end  rule  pathway.  Some,  but  not  all,  prokaryotes 
contain  Ub  (Wolf  et  al  1993).  The  bacterium  E.  coli 
lacks  Ub  but  does  have  an  N-end  rule  pathway  (Fig.  4) 
(Tobias  et  al  1991).  Screens  for  mutations  that  inactivate 
either  the  entire  N-end  rule  pathway  or  its  subset  have 
identified  three  £.  coli  genes— clpA,  clpP,  and  aat 
(Shrader  et  al  1993).  Aat  is  a  Leu,  Phe-tRNA-protein 
transferase  (L/F-transferase).  ClpA  (81  kDa)  and  ClpP 
(21  kDa)  form  a  *^750  kDa  complex,  ClpAP,  which 
exhibits  ATP-dependent  protease  activity  in  vitro 
(Gottesman  &  Maurizi  1992),  and  is  a  functional 
counterpart  of  the  eukaryotic  26S  proteasome  in  the  E. 
coli  N-end  rule  pathway  (Fig.  4). 

ClpP  exhibits  a  chymotrypsin-like  protease  activity  in 
vitro  (Gottesman  &  Maurizi  1992).  ClpA  is  the  ATP- 
binding  component  of  ClpAP.  In  vitro  studies  have 
shown  that  ClpA  can  act  as  a  chaperone  in  the  activation 
of  RepA,  the  replication  initiator  encoded  by  the 
plasmid  PI  (Wickner  et  al  1994).  In  vivo  ramifications 
of  these  resiJts,  and  in  particular  their  relevance  to  the 
proteolytic  function  of  ClpAP  in  the  E.  coli  N-end  rule 
pathway  (Fig.  4),  remain  to  be  examined.  ClpP 


associates  not  only  vdth  ClpA  (forming  ClpAP 
protease),  but  also  with  the  ClpA  homologues  ClpB 
or  ClpX,  forming,  respectively,  ClpBP  or  ClpXP 
proteases  (Gottesman  et  al  1993;  Wavrzynov  et  al 
1995).  In  contrast  to  ClpA,  whose  mutational  elimina¬ 
tion  stabilizes  the  normally  short-lived  N-end  rule 
substrates  (Tobias  et  al  1991),  the  elimination  of  either 
ClpB  or  ClpX  appears  not  to  perturb  the  E.  coli  N-end 
rule  pathway  (O.  Lomovskaya  &  A.  V.,  unpublished  data). 

N-terminal  amidases 

The  S.  cerevisiae  N-terminal  amidohydrolase 
(Nt-amidase),  encoded  by  NTA1,  is  a  52  kDa  enzyme 
which  deamidates  Asn  or  Gin  if,  and  only  if,  they  are 
located  at  the  N-terminus  of  a  polypeptide  (Figs  3  and 
5B)  (Baker  &  Varshavsky  1995;  Grigoryev  et  al  1996). 
Null  ntal  mutants  are  unable  to  degrade  N-end  rule 
substrates  that  bear  N-terminal  Asn  or  Gin.  The 
deduced  sequence  of  Ntalp  is  not  similar  to  those 
of  the  other  known  amidotransferases,  save  for  the 
sequence  Gly  Ile-Cys-Met  that  is  a  part  of  an  1 1 -residue 
region  conserved  among  some,  but  not  all,  amido¬ 
transferases.  The  conserved  cysteine  of  this  sequence  is 
required  for  the  enzymatic  activity  of  Ntalp  (Grigoryev 
et  al  1996). 

Stewart  et  al  (1995)  purified  a  porcine  Nt-amidase 
that  deamidates  N-terminal  Asn  (N)  but  not  Gin  (Q),  and 
isolated  a  cDNA  that  encodes  this  enzyme.  Grigoryev 
et  al  (1996)  isolated  and  characterized  a  ^17  kb  gene, 
termed  Ntanl,  that  encodes  a  mouse  homologue  of  the 
porcine  amidase,  termed  Nt^-amidase.  The  1.4  kb 
Ntani  mRNA  is  expressed  in  all  of  the  tested  mouse 
tissues  and  cell  lines.  The  recently  produced  ntanlA 
mouse  strains  are  viable  (Y.T.  Kwon  and  A.V,  unpub¬ 
lished  data).  Their  phenotypic  analysis  is  under  way. 

Both  Asn  and  Gin  are  destabilizing  residues  in 
the  mammalian  N-end  rule  (Fig.  1).  Further,  both 
N-terminal  Asn  and  Gin  of  the  test  proteins  are 
deamidated  in  mammalian  cell  extracts  (Gonda  et  al 
1989;  S.  Grigoryev  &  A.V,  unpublished  data).  Therefore 
there  must  exist  yet  another  mammalian  Nt-amidase 
(Nt^-amidase),  which  can  deamidate  N-terminal  Gin 
(Hg.  5A). 

Aminoacyl-tRNA-protein  transferases 

The  S.  cerevisiae  Arg-tRNA-protein  transferase 
(R-transferase),  encoded  by  ATEl,  is  a  58  kDa  enzyme 
that  utilizes  Arg-tRNA  to  arginylate  the  N-termini  of 
polypeptides  (but  not  free  amino  acids)  that  bear  Asp  or 
Glu  (Figs  3  and  5B).  Null  atel  mutants  are  unable  to 
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Figure  5  Comparison  of  enzymatic  reac¬ 
tions  that  underlie  the  activity  of  tertiary 
and  secondary  destabilizing  residues  in 
different  organisms.  (A)  Mouse  {Mus 
miisculus)  L-cells  and  rabbit  (Oryctolagus 
cuniculus)  reticulocytes  (Levy  et  al  1996; 
Gonda  et  al.  1989).  (B)  The  yeast  Sacchar¬ 
omyces  cerevisiae  (Bachmair  &  Varshavsky 
1989).  (C)  The  bacterium  Escherichia  coli 
(Tobias  et  al  1991).  The  E.  coli  N-end  rule 
lacks  tertiary  destabilizing  residues.  The 
postulated  mammalian  Nt^-amidase  (a 
question  mark  in  A)  remains  to  be 
identified. 


degrade  N-end  rule  substrates  that  bear  N-terminal 
Asn,  Gin,  Asp  or  Glu — the  tertiary  and  secondary 
destabilizing  N-terminal  residues  (Balzi  et  al.  1990).  In 
contrast  to  S.  cerevisiae,  where  only  Asp  and  Glu  are 
secondary  destabilizing  residues,  in  mammals,  Cys  is  a 
secondary  destabilizing  residue  as  well  (Gonda  et  al 
1989)  (Table  1  and  Fig.  1).  It  is  not  known  whether  the 
arginylation  of  N-terminal  Asp,  Glu  and  Cys  in 
mammals  is  catalysed  by  an  R-transferase  whose 
specificity  is  broader  than  that  of  its  yeast  counterpart, 
or  whether  N-terminal  Cys  is  arginylated  by  a  distinct 
R-transferase  (Fig.  5). 

Approximately  2h  after  a  crush  injury  to  the  rat 
sciatic  nerve,  an  extract  was  prepared  fi:om  a  segment  of 
the  nerve  immediately  upstream  of  the  crush  site.  This 
extract  was  found  to  conjugate  a  « 10-fold  higher 
amount  of  the  added  ^H-arginine  to  the  N-termini  of 
unidentified  endogenous  proteins  than  an  otherwise 
identical  extract  from  the  same  region  of  an  unper¬ 
turbed  sciatic  nerve  Payal  et  al  1990),  suggesting  a 
crush-induced  increase  in  the  level  of  N-end 
rule  substrates  and/or  a  post-crush  induction  of  the 
N-end  rule  pathway.  No  post-crush  increase  in 
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arginylation  was  observed  with  extracts  from  the 
optic  nerve,  which  does  not  regenerate  after  a  crush 
injury,  in  contrast  to  the  sciatic  nerve  (Dayal  et  al 
1990). 

R-transferase  appears  to  be  confined  to  eukaryotes, 
whereas  Leu,  Phe-tRNA-protein  transferase  (L/F- 
transferase)  is  present  in  bacteria  such  as  E.  coli  but  is 
apparendy  absent  from  eukaryotes.  E.  coli  L/F-transferase 
is  a  27  kDa  enzyme  encoded  by  the  gene  aat  (Shrader  et  al 
1993).  In  vivo,  L/F-transferase  conjugates  mainly  if  not 
exclusively  the  Leu  to  N-terminal  Arg  or  Lys  of  a 
polypeptide  substrate  (Shrader  et  al  1993)  (Fig.  4).  E.  coli 
mutants  lacking  aat  are  unable  to  degrade  N-end  rule 
substrates  that  bear  N-terminal  Ai^  or  Lys.  These  data 
(Tobias  et  al  1991)  identified  L/F-transferase  as  a 
component  of  the  E.  coli  N-end  rule  pathway. 

Ubiquidn-conjugating  enzymes 

The  initial  interaction  between  an  N-end  rule  substrate 
and  N-recognin  is  of  moderate  affinity  (the  inferred 
Kd  of  roughly  10 /4M;  Varshavsky  et  al  1997),  but 
becomes  much  stronger  if  an  internal  lysine  of  the 
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substrate  is  captured  by  a  targeting  complex  containing  a 
Ub-conjugating  (E2)  enzyme  and  N-recognin  (E3). 
This  capture  initiates  a  processive  synthesis  of  a  lysine- 
linked  multi-Ub  chain.  The  E2  enzymes  utilize 
activated  Ub,  produced  by  the  Ub-activating  (El) 
enzyme,  to  catalyse  the  formation  of  isopeptide  bonds 
benveen  the  C-terminal  Gly  76  of  Ub  and  e-amino 
groups  of  lysines  in  acceptor  proteins  (Fig.  3)  (Pickart 
1988;  Hershko  1991;  Jentsch  1992). 

In  at  least  some  Ub-dependent  systems  (Scheffiier 
et  al.  1995),  including  apparently  the  N-end  rule 
pathway  (V.  Chau  &  A.V.,  unpublished  data),  the 
pathway-specific  Ub  ligase — a  complex  of  a  recognin 
(E3)  and  an  E2  enzyme — shifts  the  activated  Ub  moiety 
(which  is  initially  linked  to  a  Cys  residue  of  the  El 
enzyme)  through  a  relay  of  Ub  thioesters  before 
conjugating  Ub  to  a  Lys  residue  of  a  targeted  substrate. 
In  a  substrate-linked  multi-Ub  chain,  the  C-terminal 
glycine  of  one  Ub  moiety  is  joined  to  an  internal 
lysine  of  the  adjacent  Ub  moiety,  resulting  in  a  chain 
of  Ub-Ub  conjugates.  In  a  multi-Ub  chain  linked  to  an 
N-end  rule  substrate,  only  Lys-48  of  Ub  was  found  to 
be  joined  to  another  Ub  moiety  within  a  chain  (Chau 
et  al.  1989).  Recently,  multi-Ub  chains  mediated  by 
Lys-63,  Lys-29,  Lys-11  or  Lys-6  of  Ub  have  been 
described  as  well  (Arnason  &  Ellison  1994;  Johnson 
et  al.  1995;  Spence  et  al.  1995;  Baboshina  Sc  Haas  1996). 
It  is  not  known  whether  these  chains  play  a  role  in  the 
N-end  rule  pathway. 

In  S.  cerevisiae,  the  ubiquitylation  of  N-end  rule 
substrates  requires  the  Ubc2p  E2  enzyme  (Dohmen  et 
al.  1991).  Processes  knovvm  to  be  perturbed  by 
mutations  in  UBC2  include  the  N-end  rule  pathway, 
DNA  repair,  induced  mutagenesis,  cell  cycle  control, 
and  sporulation  (Jentsch  1992,  and  references  therein). 
The  N-end  rule  pathway  is  inactive  in  both  ubrtA  and 
ubc2A  mutants  (Madura  et  al.  1993),  but  the  overall 
effect  of  ubc2A  on  cell  growth  and  sporulation  is  much 
more  severe  than  that  of  ubrlA,  indicating  that  the 
functions  of  Ubc2p  are  not  confined  to  the  N-end  rule 
pathway. 

The  N-end  rule  as  a  witness  of  evolution 

The  organization  of  N-end  rules,  with  their  tertiary, 
secondary  and  primary  destabilizing  residues,  is  a  feature 
more  conserved  in  evolution  than  either  the  Ub 
dependence  of  an  N-end  rule  pathway  or  the  identity 
of  enzymatic  reactions  that  mediate  the  hierarchy  of 
destabilizing  residues.  For  example,  in  a  bacterium  such 
as  E.  coli,  which  lacks  the  Ub  system,  the  N-end  rule  has 
both  secondary  and  primary  destabilizing  residues  (it 


lacks  tertiary  residues)  (Figs  1,  4  and  5C).  The  identities 
of  secondary  destabilizing  residues  in  E.  coli  (Arg  and 
Lys)  are  different  from  those  in  eukaryotes  (Figs  1  and 
5).  Bacterial  and  eukaryotic  enzymes  that  implement 
the  coupling  between  secondary  and  primary  residues 
are  also  different:  L/F-transferase  in  E.  coli  and 
R-transferase  in  eukaryotes.  Note,  however,  that 
bacterial  L/F-transferase  and  eukaryotic  R-transferase 
catalyse  reactions  of  the  same  type  (conjugation  of  an 
amino  acid  to  an  N-terminal  residue  of  a  polypeptide) 
and  utilize  the  same  source  of  activated  amino  acid 
(aminoacyl-tRNA)  (Fig.  5). 

The  apparent  confinement  of  R-transferase  to 
eukaryotes  and  of  L/F-transferase  to  prokaryotes 
suggests  that  secondary  destabilizing  residues  were 
recruited  late  in  the  evolution  of  N-end  rule,  after 
the  divergence  of  prokaryotic  and  eukaryotic  lineages. 
The  lack  of  sequence  similarity  between  the  yeast 
Nt-amidase  and  the  mammalian  Nt^-amidase,  as  well  as 
the  more  narrow  specificity  of  the  mammahan  enzyme 
(Fig.  5A,  B)  suggest  that  tertiary  destabilizing  residues 
Asn  and  Gin  became  a  part  of  the  N-end  rule  much 
later  yet,  possibly  after  the  divergence  of  metazoan  and 
fungal  lineages.  If  so,  the  N-end  rule  pathway  may  be  an 
especially  informative  witness  of  evolution:  the  ancient 
origins  of  this  proteolytic  system,  the  simplicity  and 
discreteness  of  changes  in  the  rule  books  of  N-end  rules 
among  different  species,  and  the  diversity  of  proteins 
that  either  produce  or  target  the  N-degron  should 
facilitate  phylogenetic  deductions — once  the  com¬ 
ponents  of  this  pathway  become  characterized  across  a 
broad  range  of  organisms. 

Code  vs.  hardware 

A  given  N-end  rule  is  defined  operationally — for  a  set 
of  proteins  such  as  X-jSgals  that  differ  exclusively  by 
their  N-terminal  residues.  Existing  evidence  (Bachmair 
&  Varshavsky  1989)  suggests  that  the  ranking  aspect  of 
an  N-end  rule,  i.e.,  an  ordering  of  relative  destabilizing 
activities  among  20  fundamental  amino  acids,  is 
invariant  from  one  protein  reporter  to  another  in  a 
given  intracellular  compartment.  (The  case  of  N-end 
rule  substrates  bearing  N-terminal  Pro  presents  an 
apparent  exception  to  this  conjecture;  see  below.)  By 
contrast,  the  actual  in  vivo  half-lives  may  differ  greatly 
among  different  proteins  bearing  one  and  the  same 
N-terminal  residue  (Bachmair  &  Varshavsky  1989). 
The  cause  of  these  differences  is  the  multicom¬ 
ponent  nature  of  underlying  N-degrons  (Fig.  2B). 
For  example,  in  eukaryotes,  an  N-degron  comprises 
not  only  a  destabilizing  N-terminal  residue  of  a 
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protein  but  also  its  internal  lysine  (or  lysines),  whose 
quality  as  a  determinant  can  range  from  high  to 
nonexistent. 

A  priori,  one  and  the  same  N-end  rule  can  be 
implemented  through  vastly  different  assortments  of 
targeting  hardware.  At  one  extreme,  each  destabilizing 
N-terminal  residue  may  be  bound  by  a  distinct 
N-recognin.  Conversely,  a  single  N-recognin  may  be 
responsible  for  the  entire  rule  book  of  destabilizing 
residues  in  a  given  N-end  rule.  The  actual  N-end 
rule  pathways  lie  between  these  extremes,  and  happen 
to  have  a  hierarchic  rather  than  ‘linear’  structure 
(Figs  3-5). 

Targeting  complex  of  the  N-end  rule 
pathway 

The  known  components  of  the  S.  cerevisiae  N-end  rule 
pathway  that  mediate  steps  prior  to  the  proteolysis  of 
a  targeted  substrate  by  the  26S  proteasome  are 
Nt-amidase  (Ntalp),  R-transferase  (Atelp),  N-recognin 
(Ubrlp),  a  Ub-conjugating  (E2)  enzyme  (Ubc2p),  and 
the  Ub-activating  (El)  enzyme  (Ubalp)  (Fig.  3) 
(Varshavsky  1996a),  In  addition  to  direct  (immuno- 
precipitation-based)  evidence  for  the  physical  associa¬ 
tion  between  N-recognin  and  Ubc2p  (Madura  et  ah 
1993),  there  is  also  circumstantial  (overexpression- 
based)  evidence  for  the  existence  of  a  complex  between 
N-recognin,  R-transferase  and  Nt-amidase  (Baker  & 
Varshavsky  1995).  Recently,  a  high-affmity  interaction 
between  Ntalp  and  Atelp  was  demonstrated  directly; 
other  data  suggest  that  both  Ntalp  and  Atelp  interact 
with  Ubrlp  (M.  Ghislain,  A.  Webster  &  A.V, 
unpublished  results).  In  a  quaternary  Ubc2p-Ubrlp- 
Ntalp-Atelp  complex  suggested  by  these  data,  Atelp 
and  Ntalp  interact  with  each  other  and  Ubrlp  (Fig.  3). 

Other,  perhaps  more  transient,  components  of  the 
targeting  complex  in  S.  cerevisiae  are  likely  to  include  the 
114  kDa  Ubalp  (El  enzyme,  which  must  be  bound  to 
Ubc2p  during  the  El— ^£2  transfer  of  activated  Ub 
moiety),  and  also  Arg-tRNA  synthetase.  The  latter 
possibility  is  suggested  by  the  finding  that,  in  mammals, 
Arg-tRNA  synthetase  (whose  product,  Arg-tRNA, 
is  a  co-substrate  of  R-transferase)  copurifies  with 
R-transferase  (Ciechanover  et  ah  1988).  It  is  also  likely 
that  the  targeting  complex  interacts  with  the  26S 
proteasome  in  vivo,  for  example,  during  the  transfer  of  a 
multiubiquitylated  N-end  rule  substrate  to  substrate¬ 
binding  sites  of  the  proteasome.  The  proteolytic  machine 
that  implements  the  N-end  rule  is  thus  a  strikingly 
diverse  assembly  of  enzymes  and  binding  factors  whose 
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total  mass  is  close  to  that  of  the  large  ribosomal  subunit. 
However,  even  a  transient  existence  of  this  ‘metacom¬ 
plex’  is  conjectural  at  present,  the  alternative  possibility 
being  a  sequential  formation  of  transient  subcomplexes 
that  produce  a  substrate-linked  multi-Ub  chain  and  relay 
a  substrate  toward  the  26S  proteasome. 

The  effects  of  overexpressing  Nt-amidase  and/or 
R-transferase  in  S.  cerevisiae  not  only  suggested  the 
existence  of  the  Ntalp- Ate Ip-Ubrlp-Ubc2p  complex 
but  also  led  to  the  prediction  that  Ntalp  and  Atelp  are 
associated  with  Ubrlp  in  proximity  to  its  type  1 
substrate-binding  site  (Fig.  3)  (Baker  &  Varshavsky 
1995).  The  ‘proximity’  aspect  of  the  postulated 
complex  was  invoked  to  account  for  the  markedly 
different  effects  of  overexpressed  R-transferase  on  the 
degradation  of  N-end  rule  substrates  bearing  type  1  vs. 
type  2  primary  destabilizing  N-terminal  residues  (Baker 
&  Varshavsky  1995),  In  the  diagram  of  Fig.  3,  the 
physical  proximity  of  the  bound  R-transferase  to  the 
type  1  site  of  N-recognin  is  presumed  to  decrease  the 
steric  accessibility  of  this  site  to  an  N-end  rule  substrate 
that  bears  a  type  ■  1  residue  such  as  Arg  and  approaches 
the  type  1  binding  site  of  N-recognin  directly  from 
the  bulk  solvent.  By  contrast,  a  substrate  that  acquired 
Arg  through  arginylation  by  the  N-recognin-bound 
R-transferase  would  be  able  to  reach  the  (nearby)  type  1 
binding  site  of  N-recognin  direcdy — without  dissociat¬ 
ing  into  the  bulk  solvent  first — a  feature  known  as 
substrate  ‘channeling’  in  multistage  enzymatic  reactions 
(Negrutskii  &  Deutscher  1991).  The  mechanics  of 
channeling  may  involve  the  diffusion  of  an  N-end  rule 
substrate  in  proximity  to  surfaces  of  the  targeting 
complex,  analogous  to  the  mechanism  of  a  bifiinctional 
enzyme  dihydrofolate  reductase-thymidylate  synthetase, 
where  the  channeling  of  dihydrofolate  apparendy  results 
from  its  movement  across  the  surface  of  the  protein 
(Knighton  et  ah  1994). 

The  N-degron  and  pre-N-degron 

Nascent  proteins  contain  N-terminal  Met  (fMet  in 
prokaryotes),  which  is  a  stabilizing  residue  in  the  known 
N-end  rules  (Fig.  1).  Thus,  the  N-degron  of  an  N-end 
rule  substrate  must  be  produced  from  a  pre-N-degron. 
In  an  engineered  N-end  rule  substrate,  a  pre-N-degron 
contains  the  N-terminal  Ub  moiety  whose  removal  by 
Ub-specific  proteases  yields  the  protein’s  N-degron 
(Fig.  2A).  This  design  of  a  pre-N-degron  is  unlikely  to 
be  relevant  to  physiological  N-end  rule  substrates, 
because  natural  Ub  fusions  (including  the  precursors  of 
Ub)  either  contain  a  stabilizing  residue  at  the 
Ub-protein  junction  or  bear  a  mutant  Ub  moiety  that  is 
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retained  in  vivo  (Ozkaynak  et  al.  1987;  Finley  et  al.  1989; 
Watkins  et  al  1993).  The  known  Met-aminopeptidases 
remove  N~terminal  Met  if,  and  only  if,  the  second 
residue  of  a  protein  is  stabilizing  in  the  yeast  N-end  rule 
(Fig,  1).  The  structural  basis  of  this  selectivity  is  the  size 
of  a  residue’s  side  chain  (Sherman  et  al  1985;  Arfin  & 
Bradshaw  1988;  Li  &  Chang  1995),  Specifically,  the  side 
chains  of  the  residues  that  are  destabilizing  in  the  yeast 
N-end  rule  are  larger  than  those  of  stabilizing  residues. 
The  exception  is  Met — a  bulky  hydrophobic  but 
stabilizing  residue  (Fig.  1). 

Can  there  be  just  one  or  a  few  residues  between 
N-terminal  Met  and  the  site  of  cleavage  that  produces 
an  N-degron?  If  so,  a  short  (<10  residues)  N-terminal 
sequence  might  contain  both  the  recognition  motif  and 
the  cleavage  site(s)  for  a  relevant  (unknown)  processing 
protease.  Screens  for  such  sequences,  carried  out  in  S. 
cerevisiae  (Sadis  et  al  1995;  Ghislain  et  al  1996),  did 
identify  short  (^10  residues)  N-terminal  regions  that 
conferred  Ubrlp-dependent  metabolic  instability  on  a 
reporter  protein.  Most  of  the  sequences  identified  by 
these  screens  were  not  similar  to  each  other,  possibly 
because  a  very  large  number  of  10-residue  N-terminal 
extensions  can  produce  an  N-degron  in  vivo,  analogous 
to  a  large  number  of  N-terminal  sequences  that  can 
function  as  signals  for  protein  translocation  across  the 
ER  membrane  (Kaiser  et  al  1987). 

Analysis  of  one  N-terminal  extension  identified  by 
Ghislain  et  al  (1996)  has  shown  that  it  targets  a  reporter 
protein  for  degradation  while  retaining  its  N-terminal 
Met  (M.  Gonzalez,  F.  Levy,  M.  Ghislain  &  A.V, 
unpublished  data).  This  finding  suggests  that 
N-recognin  binds  not  only  to  N-degron$  but  also  to  a 
degron  that  consists  of  an  entirely  internal  sequence 
motif.  By  contrast,  two  other  examined  (direcdy 
sequenced)  extensions  were  found  to  be  cleaved  in  vivo 
after  N-terminal  Met,  yielding  destabilizing  N-terminal 
residues  (Sadis  et  al  1995;  Ghislain  et  al  1996).  In  sum, 
we  are  just  beginning  to  understand  the  processing 
reactions  that  yield  a  destabilizing  N-terminal  residue  in 
a  non-polyprotein  context. 

Mechanics  of  N-degron 

Stochastic  capture  model 

Studies  with  jSgal-  and  DHFR-based  N-end  rule 
substrates  (Bachmair  &  Varshavsky  1989;  Chau  et  al 
1989;  Johnson  et  al  1990;  Hill  et  al  1993)  suggested  a 
stochastic  view  of  the  N-degron,  in  which  specific 
lysines  of  an  N-end  rule  substrate  can  be  assigned  a 
probability  of  being  utilized  as  a  ubiquitylation  site.  This 


probability  depends  on  the  time-averaged  spatial 
position  and  mobility  of  a  protein’s  lysine.  For  some, 
and  often  for  most  of  the  lysines  in  an  N-end  rule 
substrate,  the  probability  of  serving  as  a  ubiquitylation 
site  would  be  negligible  because  of  the  lysine’s  lack 
of  mobility  and/or  its  distance  from  a  destabilizing 
N-terminal  residue.  In  this  ‘stochastic  capture’  model 
(Fig.  2F),  the  folded  conformation  of  a  substrate  would 
be  expected  to  slow  down  or  preclude  the  search  for  a 
Lys  residue,  unless  it  is  optimally  positioned  in  the 
folded  substrate. 

The  bipartite  design  of  N-degron  (Fig.  2B)  is  also 
likely  to  be  characteristic  of  other  Ub-dependent 
degradation  signals — present  in  a  multitude  of 
naturally  short-lived  proteins  that  include  cyclins 
(Murray  &  Hunt  1993),  IkBa,  and  c-Jun  (Pahl  & 
Baeuerle  1996).  The  first  component  of  these  degrons 
is  the  internal  region  of  a  protein  (instead  of  its 
N-terminal  residue)  that  is  specific  for  each  degrada¬ 
tion  signal.  The  second  component  is  an  internal 
lysine  (or  lysines).  A  degron  may  also  contain 
regulatory  determinants  whose  modification  (e.g. 
phosphorylation/dephosphorylation)  can  modulate  the 
activity  of  this  degron  (Pahl  &  Baeuerle  1996;  Nishizawa 
et  al  1993). 

Cis^trans  recognition  and  subunit-specific 
degradation  of  oligomeric  proteins 

The  two  determinants  of  N-degron  can  be  recognized 
either  in  as  or  in  trans  (Fig.  2C,  D)  (Johnson  et  al  1990; 
F.  Levy  &  A.  V,  unpublished  data).  Experiments  that 
revealed  the  frafzs-recognition  have  also  brought  to  light 
a  remarkable  feature  of  the  N-end  rule  pathway:  only 
those  subunits  of  an  oligomeric  protein  that  contain  the 
ubiquitylation  site  (but  not  necessarily  a  destabilizing 
N-terminal  residue)  are  actually  degraded  (11). 

What  might  be  the  mechanism  of  subunit-specific 
proteolysis?  A  ‘simple’  model  is  suggested  by  the 
binding  of  a  substrate-linked  multi-Ub  chain  to  a 
component  of  the  proteasome.  Specifically,  a  subunit  of 
an  oligomeric  substrate  bound  to  the  proteasome 
through  a  subunit-linked  multi-Ub  chain  may  be  the 
only  subunit  that  undergoes  further  mechanochemical 
processing  by  chaperone-like,  ATP-dependent  com¬ 
ponents  of  the  26S  proteasome.  These  components 
mediate  the  unfolding  and  translocation  steps  that 
cause  a  movement  of  the  subunit  toward  active  sites  in 
the  proteasome’s  interior,  and  in  the  process  dissociate 
this  subunit  from  the  rest  of  oligomeric  substrate.  In 
this  mechanism,  the  initial  binding  of  N-recogmn 
to  another  subunit — that  which  bears  a  destabilizing 
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N-terminal  residue  but  not  the  lysine  determinant 
(Fig.  2C) — may  be  either  too  transient  (lasting,  in  a 
‘productive’  engagement,  only  long  enough  for  a  lysine 
to  be  captured  on  a  nearby  subunit)  or  sterically 
unfavourable  for  the  delivery  of  this  subunit  to  the 
interior  of  the  proteasome. 

Since  other  Ub-dependent  degradation  signals  appear 
to  be  organized  similarly  to  the  N-degron  (a  ‘primary’ 
recognition  determinant  plus  an  internal  lysine  or 
lysines),  subunit  selectivity  is  likely  to  be  a  general 
feature  of  proteolysis  by  the  Ub  system  (Varshavsky 
1996a)-  Examples  of  physiologically  relevant  subunit- 
selective  proteolysis  include  the  degradation  of  p53  in  a 
complex  "with  the  papilloma  viral  protein  E6  (Scheffiier 
et  al  1995)  and  the  degradation  of  a  cyclin  in  a  complex 
with  a  cyclin-dependent  kinase  (Murray  &  Hunt  1993). 

The  hairpin  insertion  model  and  the  function 
of  multiubiquitin  chain 

Formation  of  a  substrate-linked  multi-Ub  chain 
produces  an  additional  binding  site  (or  sites)  for 
components  of  the  proteasome.  The  resulting  increase 
in  affinity,  i.e.  a  decrease  in  the  rate  of  dissociation  of  the 
proteasome-substrate  complex,  can  be  used  to  facilitate 
proteolysis.  Suppose  that  a  rate-limiting  step  which 
leads,  several  steps  later,  to  the  first  proteolytic  cleavage 
of  the  proteasome-bound  substrate  is  an  unfolding 
(driven  by  thermal  fluctuations)  of  a  relevant  region  of 
the  substrate.  If  so,  an  increase  in  stability  of  the 
proteasome-substrate  complex,  brought  about  by  the 
multi-Ub  chain,  should  facilitate  the  substrate’s  degra¬ 
dation,  because  the  longer  the  allowed  ‘waiting’  time, 
the  greater  the  probability  of  a  required  unfolding 
event.  Another  (not  mutually  exclusive)  possibility  is 
that  a  substrate-linked  multi-Ub  chain  acts  as  a 
proximity  trap  for  partially  unfolded  states  of  a  substrate. 
This  might  be  achieved  through  reversible  interactions 
of  the  chain’s  Ub  moieties  with  regions  of  the  substrate 
that  undergo  local  unfolding.  A  prediction  common  to 
both  models  is  that  the  degradation  of  a  substrate  whose 
conformation  poses  less  of  a  kinetic  impediment  to  the 
proteasome  should  be  less  dependent  on  Ub  and 
ubiquitylation  than  the  degradation  of  an  otherwise 
similar  but  more  stably  folded  substrate. 

How  is  a  proteasome-bound,  ubiquitylated  protein 
directed  to  the  interior  of  the  proteasome?  This 
problem  is  analogous  to  that  in  studies  of  transmem¬ 
brane  channels  for  protein  translocation  (Simon  & 
Blobel  1991;  Schatz  &  Dobberstein  1996).  Could  the 
solutions  be  similar  in  these  systems,  reflecting,  perhaps, 
a  common  ancestry  of  translocation  channels  and 


proteasomes?  The  model  in  Fig.  2E  proposes,  by 
analogy  with  translocation  systems,  a  ‘hairpin’  insertion 
mechanism  for  the  initiation  of  proteolysis  by  the  26S 
proteasome.  A  biased  random  walk  (‘thermal  ratchet’) 
that  is  likely  to  underlie  the  translocation  of  proteins 
across  membranes  (Simon  &  Blobel  1991)  may  also  be 
responsible  for  the  movement  of  the  substrate’s  poly¬ 
peptide  chain  through  the  proteasome,  with  cleavage 
products  diffusing  out  from  the  proteasome’s  distal  end 
and  thereby  contributing  to  the  net  bias  in  the  chain’s 
bidirectional  saltations  through  the  proteasome  channel. 
One  prediction  of  the  hairpin  insertion  model  for  an 
N-end  rule  substrate  whose  N-degron’s  determinants 
are  located  upstream  of  the  hairpin  insertion  site  is  that 
the  substrate’s  N-terminal  region  (Fig.  2E)  is  likely  be 
cleaved-off*  at  later  stages  of  targeting,  and  is  therefore 
likely  to  be  spared  from  the  proteasome-mediated 
degradation. 

Two  findings  indicate  that  the  unfolding  of  a  targeted 
N-end  rule  substrate  is  a  prerequisite  for  its  degradation 
by  the  26S  proteasome.  Methotrexate — a  folic  acid 
analogue  and  high-affinity  ligand  of  DHFR  can 
inhibit  the  degradation  of  an  N-end  rule  substrate 
such  as  Arg-DHFR  by  the  N-end  rule  pathway 
(Johnston  et  al  1995).  This  result  suggests  that  a  critical 
post-ubiquitylation  step  faced  by  the  proteasome 
includes  a  ‘sufficient’  conformational  perturbation  of 
the  proteasome-bound  substrate.  Furthermore,  it  was 
shown  that  the  N-end  rule-mediated  degradation  of  a 
17kDa  N-terminal  fragment  of  the  70kDa  Sindbis 
virus  polymerase  is  not  precluded  by  the  conversion  of 
all  of  the  fragment’s  lOLys  residues  into  Arg  residues, 
which  cannot  be  ubiquitylated  (T.  Riimenapf,  J.  Strauss 
&  A.V,  unpublished  data).  Thus,  the  ubiquitylation 
requirement  of  previously  studied  N-end  rule  substrates 
may  be  a  consequence  of  their  relatively  stable 
conformations.  The  binding  of  a  largely  unfolded 
substrate  (such  as  a  fragment  of  Sindbis  polymerase)  by 
the  targeting  complex  of  the  N-end  rule  pathway  may 
be  sufficient  for  the  delivery  of  the  substrate  to  the 
proteasome’s  active  sites  in  the  absence  of  a  multi-Ub 
chain.  In  the  language  of  models  in  Fig.  2E,  F,  the 
‘waiting’  time  for  a  bound  and  conformationally 
unstable  substrate  may  be  short  enough  not  to  require 
the  formation  of  a  dissociation-slowing  device  such  as  a 
multi-Ub  chain. 

The  N-end  rule  without  ubiquitin 

No  Ub-like  covalent  modification  of  N-end  rule 
substrates  has  been  detected  in  E.  coll  in  contrast  to 
ubiquitylation  of  the  same  substrates  in  eukaryotes. 
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Moreover,  the  conversion  of  ubiquitylation-site  lysines 
of  an  N~end  rule  substrate  into  arginines  rendered  the 
substrate  long-lived  in  eukaryotes  but  did  not  impair  its 
degradation  in  E.  coli  (Tobias  et  al.  1991).  Thus,  E.  cob 
not  only  lacks  a  homologue  of  eukaryotic  Ub,  but  also 
lacks  the  requirement  for  a  lysine-specific  modification 
of  a  substrate.  Bacteria  may  contain  proteins  whose 
function  in  the  N-end  rule  pathway  is  Ub-like  but 
involves  a  noncovalent,  lysine-independent  binding  to  a 
targeted  substrate.  The  proposed  role  of  a  substrate- 
linked  multi-Ub  chain  in  ‘marking’  a  subunit  of  a 
protein  for  selective  destruction  leads  to  another  testable 
conjecture:  if  a  subunit-marking  device  is  absent  from 
the  E.  coli  N-end  rule  pathway,  the  latter  may  be 
incapable  of  degrading  an  oligomeric  protein  ‘one 
subunit  at  a  time’. 

Substrates  and  functions  of  the  N-end 
rule  pathway 

The  N-end  rule  and  osmoregulation  in  yeast 

A  synthetic  lethal  screen  was  used  to  isolate  an  S. 
cerevisiae  mutant,  termed  slnl  (for  ‘synthetic  lethal  of 
N-end  rule’),  whose  viability  requires  the  presence  of 
UBRl  (Ota  &  Varshavsky  1992).  SLNl  has  been  found 
to  encode  a  eukaryotic  homologue  of  two-component 
regulators — a  large  family  of  proteins  previously 
encountered  only  in  bacteria  (Ota  &  Varshavsky 

1993) .  The  properties  of  S.  cerevisiae  Slnlp  are  consistent 
with  it  being  a  sensor  component  of  the  osmoregulatory 
(HOG)  pathway — a  MAP  kinase  cascade  (Maeda  et  al. 

1994) .  Since  an  otherwise  lethal  hypomorphic  mutation 
in  SLNl  can  be  suppressed  by  the  presence  of  Ubrlp 
(N-recognin)  (Ota  &  Varshavsky  1993),  it  is  likely  that 
one  or  more  of  the  proteins  (e.g.,  kinases)  whose 
activity  is  down-regulated  by  Slnlp  can  also  be  down- 
regulated  through  their  degradation  by  the  N-end  rule 
pathway.  The  relevant  physiological  N-end  rule  sub- 
strate(s)  remains  to  be  identified. 

The  N-end  rule  and  the  import  of  peptides 

Alagramam  et  al.  (1995)  have  found  that  ubriA  yeast 
cells  are  unable  to  import  di-  and  tripeptides.  They  have 
also  shown  that  wfcrlA  cells,  unlike  the  congenic  JJBRt 
cells,  contain  virtually  no  PTR2  mRNA  that  encodes  a 
peptide  transporter,  an  integral  plasma  membrane 
protein.  Recent  results  (C.  Byrd  &  A.V.,  unpublished 
data)  indicated  that  the  control  of  PTR2  expression  by 
Ubrlp  (N-recognin)  involves  the  Ub-conjugating  (E2) 
enzyme  Ubc2p,  a  known  component  of  the  N-end  rule 


pathway  (Fig.  3).  The  Ubc4p  E2  enzyme  can  partially 
compensate  for  the  absence  of  Ubc2p;  deletion  of  both 
UBC2  and  UBC4  results  in  cells  that  do  not  express 
Ptr2p  and  are  unable  to  import  peptides,  similarly  to 
ubrlA  cells.  Ubc4p  has  not  been  previously  identified  as 
a  component  of  the  N-end  rule  pathway. 

The  findings  of  Alagramam  et  al.  (1995),  and  the 
observed  dependence  of  peptide  import  on  two  specific 
ubiquitin-conjugating  enzymes  can  be  accounted  for  by 
a  model  in  which  the  expression  of  the  Ptr2p 
transporter  is  regulated  by  a  short-lived  repressor  that 
is  degraded  by  the  N-end  rule  pathway.  One  prediction 
of  this  model  is  that  a  mutational  inactivation  of  the 
repressor  would  bypass  the  requirement  for  Ubrlp  in 
the  import  of  peptides.  Using  a  screen  for  such  mutants, 
we  isolated  a  gene  called  CUPP  (C.  Byrd  &  A.V., 
unpublished  data).  Its  product  is  a  homeodomain- 
containing,  short-lived  protein  whose  degradation  is 
carried  out  largely  by  the  N-end  rule  pathway.  Over¬ 
expression  of  Cup9p  inhibits  the  import  of  peptides. 
Conversely,  cupPA  cells  express  Ptr2p  and  import 
peptides  at  higher  rates  than  CUPP  cells.  Moreover, 
cupPA  cells  can  import  peptides  in  the  absence  of 
UBRly  whereas  the  import  by  CUPP  cells  requires 
UBRl.  These  findings  (C.  Byrd  &  A.V.,  unpublished 
data)  stron^y  suggest  that  Cup9p  is  the  postulated 
short-lived  repressor  which  controls  the  rate  of  peptide 
import  by  regulating  the  expression  of  the  Ptr2p 
transporter.  Remarkably,  an  earlier  study  (Knight  et  al. 
1994)  identified  Cup9p  as  a  protein  required  for  an 
aspect  of  resistance  to  copper  toxicity  in  S.  cerevisiae. 
Thus,  one  and  the  same  physiological  substrate  of  the 
N-end  rule  pathway  functions  as  both  a  repressor  of 
peptide  import  and  a  regulator  of  copper  homeostasis. 

Get  subunit  of  G  protein 

Overexpression  of  the  N-end  rule  pathway  was  found 
to  inhibit  the  growth  of  haploid  but  not  diploid  cells 
(Madura  &  Varshavsky  1994).  This  ploidy-dependent 
toxicity  was  traced  to  the  enhanced  degradation  of 
Gpalp,  the  Ga  subunit  of  the  G  protein  that  regulates 
cell  differentiation  in  response  to  mating  pheromone. 
The  half-life  of  newly  formed  Ga  at  30  is  « 50  min 
in  wild-type  cells,  ***10  min  in  cells  overexpressing  the 
N-end  rule  pathway,  and  >10  h  in  cells  lacking  the 
pathway.  The  degradation  of  Ga  is  preceded  by  its 
muldubiquitylation  (Madura  &  Varshavsky  1994).  Like 
other  Ga  subunits  of  G  proteins,  the  S.  cerevisiae  Gpalp 
bears  a  conjugated  N-terminal  myristoyl  moiety,  which 
appears  to  be  retained  on  Gpalp  during  its  targeting 
for  degradation.  A  deletion  of  the  first  88  residues 
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of  Gpalp  gready  accelerates  its  decay  but  retains 
the  dependence  of  Gpalp  degradation  on  Ubrlp 
(K.  Madura,  unpublished  data).  These  data  suggest 
that  Ubrlp  recognizes  a  feature  of  Ga  that  is  distinct 
from  the  N-degron.  Another,  N-degron-based  model 
invokes  a  trans-toigetmg  mechanism  (Fig.  2C,  D). 

Physiological  implications  of  the  Ubrlp-dependent 
degradation  of  Ga  remain  to  be  understood.  Because 
the  metabolic  stability  of  Ga  is  expected  to  be 
influenced  by  its  functional  state — Ga  can  be  GTP- 
or  GDP-bound,  covalendy  modified,  or  associated  with 
G/Sy,  the  pheromone  receptor,  and  other  Ga  ligands, 
the  degradation  of  Ga  in  yeast  may  function  either  to 
augment  or  to  inhibit  cell’s  responses  to  a  pheromone. 
A  Gj-type  Ga  is  short-lived  in  mouse  cells  as  well  (Levis 
&  Bourne  1992),  consistent  with  the  possibility  that 
Ga  subunits  of  other  organisms  are  also  degraded  by  the 
N-end  rule  pathway.  The  activation  of  mouse  Ga 
shortens  its  in  vivo  half-life  (Levis  &  Bourne  1992), 
suggesting  an  adaptation-related  function  of  Ga 
degradation.  Further,  Obin  et  al.  (1994)  described  the 
ATP-dependent  degradation  of  all  three  subunits  of  the 
bovine  retinal  G  protein  in  reticulocyte  extract.  (It  is 
not  known  whether  G/8  and/or  Gy  subunits  of  the 
S.  cerevisiae  G  protein  are  also  metabolically  unstable.) 
Hondermarck  et  al.  (1992)  (see  also  Taban  et  at.  1996) 
reported  that  differentiation  of  rat  pheochromocytoma 
PC  12  cells  is  inhibited  by  dipeptides  bearing  destabiliz¬ 
ing  N-terminal  residues.  (These  compounds  have  been 
shown  to  inhibit  the  N-end  rule  pathway  in  S.  cerevisiae 
(Baker  &  Varshavsky  1991);  their  efficacy  as  N-end  rule 
inhibitors  in  mammalian  cells  remains  to  be  evaluated.) 
Given  the  findings  with  Ga  (Madura  &  Varshavsky 
1994),  one  interpretation  of  these  results  (Hondermarck 
et  al.  1992)  is  that  inhibitors  of  the  N-end  rule  pathway 
may  suppress  cell  differentiation  through  a  metabolic 
stabilization  of  the  relevant  Ga  subunits  in  PC  12  cells. 

Sindbis  virus  RNA  Polymerase  and  other  viral 
proteins 

The  Sindbis  virus  RNA  polymerase,  also  called  nsP4 
^orntructural  Protein  4),  is  produced  by  an  endopro- 
teolytic  cleavage  of  the  viral  precursor  polyprotein 
nsP1234  (Strauss  &  Strauss  1994).  The  nsP4  protein 
bean  N-terminal  Tyr  (a  primary  destabilizing  residue; 
Figs  1  and  5A),  and  is  degraded  by  the  N-end  rule 
pathway  in  reticulocyte  extract  (deGroot  et  al.  1991). 
Tyr  is  an  N-terminal  residue  of  other  alphaviral  RNA 
polymerases  as  well  (Strauss  &  Strauss  1994),  suggesting 
that  these  homologues  of  Sindbis  polymerase  are  also 
degraded  by  the  N-end  rule  pathway.  Whereas  the  bulk 


of  newly  formed  nsP4  is  rapidly  degraded,  a  fraction  of 
nsP4  in  infected  cells  is  long-lived,  presumably  within  a 
replication  complex  that  contains  viral  and  host  proteins 
(Strauss  Strauss  1994,  and  references  therein).  This 
model  may  be  generally  applicable,  in  that  physiological 
N-end  rule  substrates — including  alphaviral  RNA 
polymerases  and  Ga  subunits  of  G  proteins — are 
likely  to  exist  in  several  states  that  differ  by  covalent 
modifications  of  a  substrate  and/or  its  associations  with 
other  ligands,  and  that  consequently  also  differ  by  the 
rates  at  which  various  forms  of  a  substrate  are  degraded 
by  the  N-end  rule  pathway. 

There  are  many  potential  N-end  rule  substrates 
derived  from  viral  polyproteins  (Dougherty  &  Sender 
1993).  One  of  them  is  the  integrase  of  the  human 
immunodeficiency  virus  (HIV),  produced  by  cleavages 
within  the  precursor  polyprotein.  The  processed 

integrase  bears  N-terminal  Phe  (Dougherty  &  Sender 
1993),  a  strongly  destabdizing  residue  in  the  N-end  rule 
(Fig.  5A).  Therefore  it  is  possible  that — simdarly  to  the 
Sindbis  virus  RNA  polymerase — at  least  a  fraction  of 
HIV  integrase  is  short-lived  in  vivo. 

c-Mos,  a  proto-oncoprotein 

This  39  kDa  Ser/Thr-kinase  is  expressed  predominantly 
in  male  and  female  germ  cells.  Sagata  and  colleagues 
have  identified  c-Mos  as  a  physiological  substrate  of  the 
N-end  rule  pathway  that  is  targeted  for  degradation 
through  its  N-terminal  Pro  residue  (Nishizawa  et  al. 
1992,  1993).  Met-Pro-Ser-Pro,  the  encoded  N-term- 
inal  sequence  of  Xenopus  c-Mos,  is  conserved  among  all 
vertebrates  examined  (Nishizawa  et  al.  1992),  Since  the 
N-terminal  Met-Pro  peptide  bond  is  readdy  cleaved  by 
the  major  cytosolic  Met-aminopeptidases  (Arfin  & 
Bradshaw  1988),  the  initially  second-position  Pro  is 
expected  to  appear  at  the  N-terminus  of  nascent  c-Mos 
cotranslationaUy  or  nearly  so. 

The  activity  of  the  Pro-based  N-degron  in  c-Mos  is 
inhibited  through  the  phosphorylation  of  Ser-2  (Ser-3 
in  the  c-Mos  ORF)  (Nishizawa  et  al.  1992,  1993). 
During  the  maturation  of  Xenopus  oocytes,  c-Mos  is 
phosphorylated  partially  and  reversibly,  and  therefore 
remains  short-lived.  Later,  at  the  time  of  germinal 
vesicle  breakdown  and  the  arrest  of  mature  oocytes 
(eggs)  at  the  second  meiotic  metaphase,  c-Mos  becomes 
long-lived,  owing  to  its  nearly  stoichiometric  phos¬ 
phorylation  at  Ser-2  (Watanabe  et  al.  1991).  Fertiliza¬ 
tion  or  mechanical  activation  of  a  Xenopus  egg  releases 
the  meiotic  arrest  through  the  induced  degradation  of 
c-Mos— caused  by  a  nearly  complete  dephosphoryla¬ 
tion  of  phosphoserine-2  (Nishizawa  et  al.  1992,  1993). 
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Consistent  with  this  model  of  the  N-degron  in  c-Mos, 
the  replacement  of  Ser--2  with  Asp  or  Glu  (whose 
negative  charge  mimics  that  of  the  phosphoryl  group) 
rendered  c-Mos  long-hved,  whereas  the  replacement 
of  Ser-2  with  Ala  yielded  a  constitutively  unstable 
c-Mos  (Nishizawa  et  al  1992).  Lys-33  (Lys-34  in  the 
c-Mos  ORF)  is  a  major  ubiquitylation  site  of  the  c-Mos 
N-degron  (Nishizawa  er  al  1993). 

In  contrast  to  N-terminal  Pro  in  the  context  of  c-Mos, 
the  N-terminal  Pro  followed  by  the  sequence  His-Gly 
Ser-. . .  (this  is  the  context  of  engineered  N-end  rule 
substrates  such  as  X-|8gal  and  X-DHFR  (Varshavsky 
1992))  did  not  confer  a  short  half-life  on  a  reporter 
protein  in  either  yeast  or  mammalian  cells  (F.  Levy, 
T.  Riimenapf  &  A.V,  unpublished  data).  One  interpre¬ 
tation  of these  results  is  that  the  N-degron  of  c-Mos,  whose 
conserved  N-terminal  sequence  is  Pro-Ser-Pro-. . has  a 
‘degron-enabling’  internal  determinant  additional  to, 
and  perhaps  specific  for,  the  N-terminal  Pro,  The  c-Mos 
N-degron  is  the  first  example  of  N-degron  whose 
activity  is  regulated  by  phosphorylation  (Nishizawa  et  al. 
1992). 

Compartmentalized  proteins  retrotransported 
to  the  cytosol 

In  contrast  to  cytosolic  and  nuclear  proteins,  the 
proteins  that  function  in  (or  pass  through)  the  ER, 
Golgi,  and  related  compartments  often  bear  destabiliz¬ 
ing  N-terminal  residues — the  consequence  of  cleavage 
specificity  of  signal  peptidases,  which  remove  signal 
sequences  from  proteins  translocated  into  the  ER 
(Bachmair  et  al  1986).  We  have  suggested  that  one 
function  of  the  N-end  rule  pathway  may  be  the 
degradation  of  previously  compartmentalized  proteins 
that  ‘leak’  or  are  transported  into  the  cytosol  from 
compartments  such  as  ER  (Bachmair  et  al  1986; 
Varshavsky  1992).  Remarkably,  it  has  recendy  been 
found  that  at  least  some  compartmentalized  proteins  can 
be  retrotransported  to  the  cytosol  through  a  route  that 
requires  specific  ER  proteins,  USll,  the  ER-resident 
transmembrane  protein  encoded  by  cytomegalovirus, 
causes  the  newly  translocated  MHC  class  I  heavy  chain 
to  be  selectively  retrotransported  back  to  the  cytosol, 
where  the  heavy  chain  is  degraded  by  a  proteasome- 
dependent  pathway  (Wiertz  et  al  1996).  Similarly, 
CPY*,  a  defective  vacuolar  carboxypeptidase  of 
S.  cerevisiae,  is  retrotransported  to  the  cytosol  shordy 
after  entering  the  ER,  and  is  degraded  in  the  cytosol  by 
a  Ub/proteasome-dependent  pathway  that  requires  the 
UbcTp  Ub-conjugating  enzyme  (Hiller  et  al  1996). 
The  expected  N-terminal  residue  of  the  translocated 


and  processed  MHC  class  I  heavy  chain  is  Gly — a 
stabilizing  residue  (Fig.  1).  The  expected  N-terminal 
residue  of  the  wild-type  CPY  carboxypeptidase  whose 
signal  sequence  had  been  cleaved  off  is  He — a  primary 
destabilizing  residue  (Fig.  1).  Whether  the  N-end 
rule  pathway  plays  a  role  in  the  degradation  of 
retrotransported  proteins  remains  to  be  determined. 

Potential  N-end  rule  substrates 

Several  cytosolic  and  nuclear  proteins  are  known  to  bear 
destabilizing  N-terminal  residues,  but  have  not  been 
shown,  thus  far,  to  be  degraded  by  the  N-end  rule 
pathway.  Among  them  are  the  X  phage  cll  protein,  the 
S.  cerevisiae  Cup  Ip  protein,  the  catalytic  subunits  of 
calpains  (calcium-dependent  proteases),  and  several 
histone-like,  micronucleus-specific  proteins  of  Tetra- 
hymena.  These  putative  N-end  rule  substrates  are 
discussed  by  Varshavsky  et  al.  (1997). 

Applications  of  N-degron 

The  portability  and  modular  organization  of  N-degrons 
make  possible  a  variety  of  apphcations  whose  common 
feamre  is  the  conferring  of  a  constitutive  or  conditional 
metabolic  instability  on  a  protein  of  interest.  These 
applications  are  discussed  elsewhere  (Varshavsky  1995, 
1996a, b). 

Concluding  remarks 

Although  many  things  have  been  learnt  about  the  N-end 
rule  since  its  discovery  10  years  ago,  the  answers  to 
several  key  questions  remain  unknovm.  For  example, 
the  detailed  mechanics  of  targeting  is  not  understood. 
The  biochemical  dissection  of  the  N-end  rule  pathway 
reconstituted  in  vitro  from  defined  (cloned)  components 
will  be  essential  for  attaining  this  goal.  Crystallographic- 
quality  structural  information  about  N-recognin  and 
the  entire  targeting  complex  will  be  required  as  well. 
The  recently  emerged  possibility  that  N-recognin  may 
target  not  only  N-degrons  but  also  other  degradation 
signals  adds  yet  another  level  of  complexity  which  will 
have  to  be  addressed. 

Genetic  screens  for  proteins  degraded  by  the  N-end 
rule  pathway  are  our  best  hope  for  bringing  to  light 
physiological  N-end  rule  substrates.  It  is  already  clear 
that  at  least  some  of  these  substrates  are  conditionally 
unstable — for  example,  partitioned  between  a  short¬ 
lived  free  substrate  and  a  long-lived  complex  of  the 
substrate  with  other  proteins.  In  addition,  for  some 
substrates,  the  rate-limiting  step  in  their  degradation 
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may  be  a  processing  (cleavage)  event  that  produces  an 
N~degron  from  a  pre-N-degron.  If  so,  a  significant 
fraction  of  extant  substrate  molecules  may  bear  a 
stabilizing  N-terminal  residue.  Given  these  obstacles 
to  identifying  physiological  N-end  rule  substrates,  they 
are  likely  to  be  more  numerous  than  is  apparent  at  the 
present  time. 
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The  ubiquitin  system 


Alexander  Varshavsky 

Eukaryotes  contain  a  highly  conserved  multi-enzyme  system  that  covalently 
links  ubiquitin  to  a  variety  of  intracellular  proteins  that  bear  degradation 
signals  recognized  by  this  system.  The  resulting  ubiquitin-protein  conjugates 
are  degraded  by  the  26S  proteasome,  a  large  ATP-dependent  protease. 
Pathways  that  involve  ubiquitin  underlie  a  multitude  of  processes,  including 
cell  differentiation,  the  cell  cycle  and  responses  to  stress. 


UBIQUITIN  (Ub)  is  a  76-residue  pro¬ 
tein  that  exists  in  cells  either  free  or  co¬ 
valently  linked  to  other  proteins.  Ub- 
dependent  pathways  have  been  shown 
to  play  major  roles  in  a  legion  of  biologi¬ 
cal  processes,  including  cell  differenti¬ 
ation,  the  cell  cycle,  embryogenesis, 
apoptosis,  signal  transduction,  DNA 
repair,  transmembrane  and  vesicular 
transport,  stress  responses  (including 
the  immune  response)  and  functions 
of  the  nervous  system.  Many,  but  not 
all,  of  the  UlMlependent  pathways 
involve  processive  degradation  of  Ub- 
conjugated  (ubiquitylated*)  proteins  by 
the  26S  proteasome  -  an  ATP-dependent, 
multisubunit  protease.  1  will  remark  on 
the  history  of  this  field,  consider  the 
enzymology  and  mechanics  of  the  Ub 
system,  and  describe  some  of  its  degra¬ 
dation  signals  and  physiological  func¬ 
tions.  A  review  of  this  length  can  offer 
but  a  glimpse  of  the  Ub  system,  mak¬ 
ing  pau'tial  amends  with  references  to 
more  detailed  reviews  and  recent  origi¬ 
nal  works. 

The  beginnings 

Ub  was  first  described  in  1975  as  an 
abundant,  highly  conserved  protein, 
hence  its  name^  In  1978-1985,  Hershko 
and  co-workers  identified  Ub  as  an  essen¬ 
tial  component  of  the  ATP-dependent 
proteolytic  system  in  an  extract  from 
rabbit  reticulocytes,  and  deciphered 
in  vitro  the  E1-E2-E3  enzymology  of  Ub 
conjugation^.  Having  been  attained  in  the 
setting  of  a  cell-free  system,  these  impor¬ 
tant  early  advances  did  not  reveal  the 
functions  of  the  ubiquitin  system.  They 
also  did  not  address  the  nature  of  a  degra¬ 
dation  signal  in  a  short-lived  protein. 
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In  1984,  we  identified  the  mouse  cell 
line  ts85  as  a  temperature-sensitive  mu¬ 
tant  in  the  Ub-activating  enzyme^,  and 
produced  the  first  evidence  that  Ub  is  re 
quired  for  protein  degradation  in  living 
cells.  Ub-dependent  proteolysis  was  found 
to  be  essential  for  cell-cycle  progression, 
as  ts85  cells  were  arrested  specifically  in 
the  G2  phase  of  the  cell  cycle^.  This  in¬ 
sight  presaged  the  understanding  of  the 
role  of  cyclins  and  cyclin-like  proteins 
whose  periodic  Ub-dependent  destruc¬ 
tion  drives  and  regulates  the  cell  cycle^  l 

In  1986,  we  discovered  the  first  degra¬ 
dation  signal  of  the  Ub  system.  The 
in  vivo  half-life  of  a  protein  was  found  to 
depend  on  the  identity  of  the  protein’s 
N-terminal  residue  -  a  relation  termed 
the  N-end  rule^l 

In  1987,  following  the  initial  insight 
with  ts85  cells^,  we  discovered  the  first 
physiological  functions  of  the  Ub  system, 
by  demonstrating  that  RAD6  and  CDC34 
-  key  components  of  DNA  repair  and  cell 
cycle  control  -  are  in  fact  Ub<onjugating 
enzymes®’^  A  non-proteolytic,  chaperonin 
function  of  Ub  was  also  identified  -  by  the 
finding  that  a  transient  linkage  of  Ub  to 
nascent  ribosomal  proteins  is  required  for 
the  efficient  biogenesis  of  ribosomes^®. 

In  1989,  we  discovered  the  multi-Ub 
chain^\  an  advance  that  initiated  the 
understanding  of  the  mechanistic  role  of 
Ub,  It  was  also  found  that  Ub-dependent 
proteolysis  can  destroy  a  subunit  of  an 
oligomeric  protein  selectively,  leaving  in¬ 
tact  the  rest  of  the  protein’s  subunits*^. 
This  fundamental  capability  of  the  Ub 
system  accounts  for  large  differences  in 
the  m  vivo  half-lives  of  subunits  in  many 
regulatory  proteins. 

Another  advance,  which  concluded 
the  early  era  of  Ub  studies,  was  the 
demonstration  by  several  laboratories 
that  a  -2000  kDa,  ATP-dependent  pro¬ 
tease,  termed  the  26S  proteasome, 
specifically  degrades  Ub-protein  conju¬ 


gates^*^^*^  The  26S  proteasome  consists 
of  the  previously  identified  20S  ‘core’ 
proteasome^®"**  and  a  complex  contain¬ 
ing  multiple  ATPases  (the  19S  ‘cap’)  at 

both  ends  of  the  20S  proteasome*^2i 

Organization  and  enzymology  of  the 
ubiquitin  system 

Eukaryotic  cells  contain  Ub-specific 
enzymes  that  catalyse  reactions  whose 
product  is  either  a  single  Ub  moiety  or  a 
multi-Ub  chain  covalently  linked  to  an 
acceptor  protein  (Fig.  1).  Ub  is  conjugated 
to  other  proteins  through  an  amide  bond, 
called  the  isopeptide  bond,  between  the 
C-terminal  (Gly76)  residue  of  Ub  and 
the  €-amino  group  of  a  Lys  residue  in  an 
acceptor  protein^ 

Ub  is  activated  for  conjugation  to  other 
proteins  by  a  Ub-activating  enzyme  (El), 
which  couples  ATP  hydrolysis  to  the 
formation  of  a  high-energy  thioester  bond 
between  Gly76  of  Ub  and  a  specific  Cys 
residue  of  El  (Fig.  l)2-2*.23  El-linked 
Ub  moiety  is  moved,  in  a  transesterifi¬ 
cation  reaction,  from  El  to  a  Cys  residue 
of  a  Ub^onjugating  enzyme  (E2),  and 
from  there  to  a  Lys  residue  of  an  ultimate 
acceptor  protein,  yielding  a  Ub-protein 
conjugate  (Fig.  1).  This  last  step  requires 
the  participation  of  another  component, 
called  E3  or  recognin^-^*’^^  which  selects 
a  protein  for  ubiquitylation  through  an 
interaction  with  its  degradation  signal. 
At  least  three  of  the  known  E3  proteins  - 
the  mammalian  E6AP,  its  yeast  (Schizo- 
saccharomyces  pombe)  homolog  Pub  Ip, 
and  Ubrlp,  the  E3  of  the  N-end  rule  path¬ 
way  -  are  also  bona  fide  enzymes  that 
act  at  the  step  between  an  E2  and  an  ulti¬ 
mate  acceptor  of  Ub.  These  (and  poss¬ 
ibly  other)  E3s  catalyse  in  particular  the 
movement,  through  transesterification, 
of  the  Ub  moiety  from  the  Cys  residue  of 
a  relevant  (E3-associated)  E2  enzyme  to 
a  Cys  residue  of  E3  itself  -  in  a  reaction 
analogous  to  the  Ub  transfer  from  El  to 
E2  (Fig.  1)  (see  Refs  24,  25;  V.  Chau  and 
A.  Varshavsky,  unpublished). 

MultHJbiquitin  chains.  In  these  struc¬ 
tures,  the  C-terminal  Gly  of  one  Ub  is 
joined  to  an  internal  Lys  of  the  adjacent 
Ub  moiety,  resulting  in  a  chain  of  Ub-Ub 
conjugates  containing  two  or  more  Ub 
moieties**-^®.  Ub  has  seven  lysines;  thus, 

•Ubiquitin  whose  C-terminal  {Gly76)  carboxyl  group 
is  covalently  linked  to  another  compound  is  called 
the  ubiquity!  moiety,  the  derivative  terms  being 
ubiquitylation  and  ubiquitylated  The  term  Ub  refers 
to  both  free  ubiquitin  and  the  ubiquity!  moiety.  This 
nomenclature®^  which  is  also  recommended  by  the 
Nomenclature  Committee  of  the  International  Union 
of  Biochemistry  and  Molecular  Biology®®,  brings 
ubiquitin-related  terms  in  line  with  the  standard 
chemical  terminology. 
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Rgure  1 

The  ubiquitin  system  of  Saccharomyces  cerevisiae.  The  yeast  ubiquitin  (Ub)  genes,  two  of 
which  {UBIl  and  UBI2j  contain  introns,  encode  fusions  of  Ub  (yellow  rectangles)  to  Itself  ((/S/4) 
or  to  one  of  the  two  specific  ribosomal  proteins  {UBIl-UBI^  (red  and  blue  rectangles)^^  ®®. 
These  fusions  are  cleaved  by  Ub-specific  processing  proteases^^  yielding  mature  Ub. 
Thioester  bonds  between  Ub  and  the  active-site  Cys  residues  of  Ub-specific  enzymes  are 
denoted  by  The  conjugation  of  Ub  to  other  proteins  involves  a  preliminary  ATP-dependent 
step,  in  which  the  last  (Gly76)  residue  of  Ub  is  joined,  via  a  thioester  bond,  to  a  Cys  residue 
in  the  Ub-activating  (El)  enzyme  encoded  by  the  UBAl  gene^°.  The  activated  Ub  is  trans¬ 
ferred  to  a  Cys  residue  in  one  of  at  least  13  distinct  Ub-conjugating  (E2)  enzymes  encoded 
by  the  genes  UBCI-UBCIB  (Refs  8, 21,  23),  and  from  there  to  a  Lys  residue  of  an  ultimate 
acceptor  protein  (yellow  oval).  This  last  step,  and  apparently  also  the  formation  of  a  multl- 
Ub  chain  (black  ovals),  requires  the  participation  of  another  component,  called  E3  or 
recognln^-^^.  The  function  of  E3  Includes,  but  is  not  limited  to,  the  recognition  of  a  degra¬ 
dation  signal  in  the  acceptor  protein  (see  the  main  text).  The  names  of  some  of  the  cur¬ 
rently  known  yeast  E3s  (Ubrlp,  RspBp,  Ufd4p  and  Tomlp)^!  are  Indicated  as  well.  ‘Rad6' 
and  ‘Cdc34’  refer  to  the  alternative  (earlier)  names  of  the  Ubc2p  and  Ubc3p  E2  enzymes^^ 
A  targeted,  ubiquitylated  protein  is  processively  degraded  to  short  peptides  by  the  ATP- 
dependent  26S  proteasome  (see  the  main  text). 


a  multi-Ub  chain  in  which  no  Ub  moiety 
has  more  than  one  Lys  residue  linked  to 
another  Ub  moiety  can  have,  a  priori, 
any  of  the  seven  ‘pure’  topologies  or  a 
far  greater  number  of  ‘mixed*  topolo¬ 


gies.  The  first  multi-Ub  chain  to  be  dis¬ 
covered  had  its  Ub  moieties  conjugated 
through  the  Lys48  residue  of  Ub’»". 
Other  multi-Ub  chains  involve  Lys63  or 
Lys29  of  Ub^^"^^  A  chain  linked  through 


the  Lys63  residues  of  its  Ub  moieties 
appears  to  have  a  distinct  role  in  a  path¬ 
way  of  DNA  repair^T  As  discussed  below, 
one  function  of  a  substrate-linked  multi- 
Ub  chain  is  to  facilitate  the  substrate’s 
degradation  by  the  26S  proteasome. 

De-ubiquitylation.  The  covalent  bond 
between  Ub  and  other  proteins  can 
be  cleaved:  there  are  multiple,  ATP- 
independent  proteases  (Saccharomyces 
cerevisiae  has  at  least  20  of  them)  whose 
common  property  is  the  ability  to  recog¬ 
nize  a  Ub  moiety  and  cleave  at  the  Ub- 
adduct  junction^^  ^®^^  One  cause  of  the 
striking  multiplicity  of  Ub-specific  pro¬ 
cessing  proteases  (UBPs)  is  the  diversity 
of  their  targets,  which  include  linear 
(DNA-encoded)  Ub  fusions Ub  adducts 
with  small  nucleophiles  such  as  gluta- 
thione^^  and  also  free^^  or  substrate- 
linked  multi-Ub  chains”*^'"^^ 

The  junctions  in  linear  Ub  adducts 
(natural  or  engineered  Ub  fusions)  are 
structurally  distinct  from  the  junctions 
in  branched  Ub  conjugates.  Thus,  some 
of  the  UBPs  may  have  preferences  for 
either  linear  or  branched  Ub  adducts.  In 
addition,  a  Ub  conjugate  can  be  spatially 
confined  in  a  cell,  making  it  accessible 
only  to  some  UBPs.  For  example,  a  UBP 
associated  with  the  26S  proteasome  may 
have  privileged  access  to  the  muIti-Ub 
chain  of  a  proteasome-bound  substrate. 
Recent  evidence  suggests  that  this  is  in¬ 
deed  the  case^^\  and  that  proteasome- 
associated  UBPs  can  function  as  editing 
enzymes,  whose  ability  to  de-ubiquitylate 
an  already  targeted  (proteasome-bound) 
Ub-substrate  conjugate  can  modulate  the 
rate  of  degradation  of  the  substrate^. 

A  UBP  may  also  bind  specifically  to  a 
protein  that  bears  a  degradation  signal. 
In  the  absence  of  this  UBP,  the  protein  is 
ubiquitylated  and  degraded,  whereas  the 
presence  of  the  protein-bound  UBP  pre¬ 
vents  the  protein’s  degradation  -  by 
counteracting  the  formation  of  a  protein- 
linked  multi-Ub  chain.  Potential  examples 
of  this  UBP  function  in  physiologically 
relevant  settings  have  recently  been  dis¬ 
covered^^,  bringing  to  light  yet  another 
way  in  which  the  Ub  system  regulates  the 
destruction  of  specific  proteins. 

Degradation  signals  of  the  Ub  system 

Degradation  signals  are  features  of  pro¬ 
teins  that  confer  metabolic  instability. 
The  number  of  distinct  Ub-dependent 
degradation  signals  (defined  as  the 
number  of  different,  functionally  non- 
redundant  targeting  complexes)  in,  for 
example,  S.  cerevisiae,  is  unknown,  but 
is  likely  to  exceed  ten.  Some  of  these  sig¬ 
nals  are  understood  much  better  than 
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•the  others,  and  new  signals  are  certain  to 
be  discovered.  Degradation  signals  can 
be  active  constitutively  or  conditionally. 
Signals  of  the  latter  class  -  found  in  many 
regulators,  including  cyclins  and  tran¬ 
scription  factors  -  are  controlled  through 
phosphorylation  or  interactions  with 
other  proteins,  whose  binding  may  steri- 
cally  shield  an  otherwise  constitutive 
degradation  signal^’^-^*-^^.  Alternatively,  a 
protein,  by  forming  a  complex  with 
another  protein,  may  confer  a  short 
in  vivo  half-life  on  the  latter  -  through  the 
recruitment  of  a  specific  targeting  com¬ 
plex  of  the  Ub  system.  One  physiologi¬ 
cally  relevant  example  of  this  mechanism 
is  the  enhancement  of  instability  of  the 
short-lived  p53  transcriptional  regulator 
(and  tumor  suppressor)  by  the  papilloma 
virus  E6  protein,  whose  binding  to  p53 
recruits  a  targeting  complex  containing 
in  particular  the  E6AP  E3  protein^^*^. 

The  N-degron.  This  degradation  signal 
was  discovered  through  the  invention  of 
the  Ub  fusion  technique,  which  made  it 
possible  to  produce  any  desired  residue 
at  the  N-terminus  of  a  test  protein  in  a 
cell®.  The  in  vivo  half-life  of  a  protein 
was  found  to  depend  on  the  identity  of 
its  N-terminal  residue  -  a  relation  termed 
the  N-end  rule®-^.  In  eukaryotes,  the 
N-end  rule  pathway  is  a  part  of  the  Ub 
system.  However,  a  version  of  this  path¬ 
way  is  also  present  in  bacteria  such  as 
Escherichia  coli,  which  lack  the  Ub  sys- 
teml  The  N-degron,  a  signal  recognized 
by  the  N-end  rule  pathway,  comprises 
two  essential  determinants:  a  destabiliz¬ 
ing  N-terminal  residue  and  one  or  more 
internal  lysines  of  a  substrate'^*^^.  The 
Lys  residue  is  the  site  of  formation  of  a 
multi-Ub  chain".  The  N-end  rule  path¬ 
way  is  reviewed  in  Ref.  7. 

Subsequently  identified  degradation 
signals  appear  to  be  organized  similarly 
to  the  N-degron,  in  that  they  comprise 
two  distinct  elements  -  an  amino  acid 
sequence  or  a  conformational  determi¬ 
nant  (analogous  to  a  destabilizing  N- 
terminal  residue)  and  a  Lys  residue  or 
residues,  the  latter  being  the  site  of 
ubiquitylation. 

N-terminal  ubiquitin  as  a  degradation  signal. 
Protein-linked  multi-Ub  chains  function 
as  secondary  signals  for  degradation,  in 
that  post-translational  conjugation  of  Ub 
to  a  substrate  is  mediated  by  the  sub¬ 
strate’s  amino  acid  sequences  that  act 
as  a  primary  degradation  signal.  An  ex¬ 
ception  to  this  arrangement  is  a  linear 
(DNA-encoded)  Ub  fusion  bearing  a  non¬ 
removable  N-terminal  Ub  moiety;  a  pro¬ 
tein  thus  designed  is  short-lived  in 
5.  cerevisiae,  its  Ub  functioning  as  a  pri¬ 


mary  degradation  signal^^.  The  normally 
rapid  cleavage  of  linear  Ub  fusions  at  the 
Ub-protein  junction  can  be  abolished 
(making  the  Ub  moiety  ‘nonremovable’) 
by  converting  the  last  (Gly)  residue  of  Ub 
into  Val^^.  Such  Ub  fusions  are  degraded 
by  the  Ub/proteasome<lependent  system 
termed  the  UFD  pathway  (ubiquitin/ 
fusion/degradation)^^,  which  is  distinct 
from  the  N-end  rule  pathway^^'^®. 

What  are  the  physiological  UFD  sub¬ 
strates?  Eukaryotes  contain  a  number  of 
genes  encoding  proteins  that  bear  do¬ 
mains  highly  similar  to  Ub^^  Recently, 
one  such  protein,  Rad23p  of  5.  cere- 
visiae^\  was  found  to  be  conditionally 
short-lived,  its  degradation  being  medi¬ 
ated  by  the  UFD  pathway  and  the  Ub-like 
N-terminal  domain  of  Rad23p  (K.  Madura, 
pers.  commun.).  Because  this  domain  is 
required  for  the  Rad23p  function  of  re¬ 
pairing  UV-damaged  DNA^^  it  is  likely 
that  the  metabolic  instability  of  Rad23p 
is  a  significant  aspect  of  its  function.  This 
first  example  of  a  physiological  UFD  sub¬ 
strate  suggests  that  Ub-like  N-terminal 
domains  of  other  proteins  may  also  act  as 
degradation  signals,  and  that  metabolic 
instability  is  relevant  to  the  functions  of 
proteins  bearing  Ub-like  domains. 

Other  degradation  signals.  One  of  the 
better-understood  signals  resides  in  the 
transcriptional  activator  Gcn4p,  which 
induces  5.  cerevisiae  genes  whose  prod¬ 
ucts  mediate  the  synthesis  of  amino 
acids  and  purines.  Gcn4p  is  a  short-lived 
protein  (tj^  «  5  min)  whose  degradation 
requires  the  Cdc34p  Ub-conjugating  (E2) 
enzyme^^.  The  sequence  SSSTDSTP  (in 
single-letter  abbreviations)  at  positions 
99-106  of  the  281-residue  Gcn4p  encom¬ 
passes  a  major  part  of  its  degradation 
signal.  (Conversions  of  certain  residues 
in  this  sequence  into  alanines  signifi¬ 
cantly  stabilized  Gcn4p;  a  deletion  of 
the  entire  eight-residue  region  had  the 
strongest  effect,  increasing  the  tj^2 
Gcn4p  by  approximately  tenfold^^.) 

Sequences  rich  in  Pro,  Glu,  Ser  and 
Thr,  referred  to  as  PEST,  have  been  pro¬ 
posed  to  function  as  degradation  sig- 
nals^^.  PEST-like  sequences  were  indeed 
implicated  in  the  Cdc34p-dependent 
degradation  of  Gcn4p  and  a  cyclin  Cln3p 
(Refs  42, 44).  At  the  same  time,  the  PEST 
motifs  as  currently  formulated  are  pre¬ 
sent  in  roughly  one-third  of  the  known 
open  reading  frames®.  To  serve  as  useful 
prognosticators  of  degradation  signals, 
these  motifs  will  have  to  be  defined 
more  selectively. 

No  single  Lys  residue  in  the  vicinity  of 
the  99-106  sequence  was  uniquely  re¬ 
quired  for  the  Gcn4p  degradation,  but 


converting  five  or  more  of  these  lysines 
into  arginines  (which  cannot  be  ubiquit- 
ylated^O  cause  a  significant  stabiliz¬ 
ation  of  Gcn4p  (Ref.  42).  This  result  is 
consistent  with  the  previously  proposed 
‘stochastic-capture’  mechanism,  in  which 
a  reversible  binding  of  E3  to  a  non-Lys 
determinant  of  a  degradation  signal  initi¬ 
ates  a  time-restricted  search  by  the 
E3-E2  complex  for  a  sterically  accessible 
Lys  residue'*^-^.  An  E3-E2  complex  that 
recognizes  the  degron  of  Gcn4p  contains 
in  particular  Cdc53p,  which  is  bound  to 
the  Cdc34p  E2  enzyme  and  is  required 
for  the  degradation  of  5.  cerevisiae  G1 
cyclins  and  cyclin-like  proteins,  such  as 
Cln2p  (Ref.  45)  and  Sic  Ip.  Homologs  of 
Cdc53p  were  identified  in  all  eukaryotes 
examined,  suggesting  that  a  complex 
containing  Cdc53p,  Cdc34p  and  other 
proteins^®'*®  targets  phylogenetically 
conserved  sequence  motifs  present  in 
many  short-lived  proteins  and  exempli¬ 
fied  by  the  degradation  signal  of  Gcn4p. 

Mata2,  the  S.  cerevisiae  cell-type  spe¬ 
cific  transcriptional  repressor,  has  two 
distinct  degradation  signals^^  at  least 
one  of  which  has  the  unusual  property 
of  requiring  two  E2  enzymes,  Ubc6p  and 
Ubc7p,  for  its  activity^^  A  mammalian 
transcription  factor  c-Jun  is  a  short-lived 
protein  whose  Ub-dependent  degradation 
signal  is  located  in  a  region  called  5- 
domain^*’®^.  The  degradation  of  c-Jun  is 
inhibited  upon  its  functional  activation 
by  the  mitogen-activated  protein  ki¬ 
nase  (MARK),  which  phosphorylates  Ser 
and  Thr  residues  in  the  vicinity  of  the 
8-domain^®.  NF-kB,  another  mammalian 
transcription  factor,  is  regulated  in  part 
through  an  association  with  iKBa,  a  con¬ 
ditionally  short-lived  subunit  of  the  func¬ 
tionally  inactive,  cytosolic  complex  con¬ 
taining  NF-kB^^  '*^®®.  Phosphorylation  of 
IkBc,  in  response  to  a  variety  of  stresses, 
activates  its  degradation  signal,  resulting 
in  the  Ub-dependent,  subunit-selective 
degradation  of  kBa  that  releases  the 
previously  cytosolic  NF-kB  for  translo¬ 
cation  into  the  nucleus.  Another  form  of 
inactive  (cytosolic)  NF-kB  contains  an 
inhibitory  105-kDa  subunit  whose  Ub- 
dependent  partial  degradation  yields 
the  p50  subunit  and  results  in  the  active 
(nuclear)  form  of  NF-kB.  Thus,  the  Ub 
system  is  capable  of  destroying,  proces- 
sively,  a  specific  region  of  a  polypeptide 
while  sparing  the  rest  of  it®’.  (This  sim¬ 
plified  description  of  the  NF-kB  system 
belies  its  great  complexity^^*®®.) 

Folding  vs  degradation.  Most  (but  not  all) 
of  the  damaged  or  otherwise  abnormal 
proteins  are  recognized  and  destroyed  by 
the  Ub  system  -  apparently  through  the 
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axposure  of  their  normally  buried  degra¬ 
dation  signals.  This  raises  the  possibility 
that  a  normal  and  otherwise  long-lived 
but  nascent  (and  therefore  conformation- 
ally  immature)  protein  may  be  ‘regarded’ 
by  the  Ub  system  as  damaged  until  the 
protein  assumes  its  final  conformation. 
Thus,  the  folding  of  at  least  some  newly 
formed  proteins  is  expected  to  be  in  ki¬ 
netic  competition  with  pathways  that 
target  them  for  degradation.  Chaperonins, 
including  members  of  the  Hsp70p  fam¬ 
ily,  play  major  roles  not  only  in  the  fold¬ 
ing  of  newly  made  proteins  but  also  in 
the  competing  process  of  their  degrada¬ 
tion  by  the  Ub  system.  Understanding  of 
the  interplay  between  the  folding  and 
proteolytic  pathways  has  markedly  ad¬ 
vanced  in  recent  years^*"^. 

Mechanism  of  ubiquitiiHiependent  proteolysis 

Why  must  a  multi-Ub  chain  be  linked  to 
a  short-lived  protein  before  its  destruc¬ 
tion  by  the  26S  proteasome?  We  proposed 
that  a  major  function  of  a  substrate-linked 
multi-Ub  chain  is  to  decrease  the  rate  of 
dissociation  of  a  substrate-proteasome 
complex,  thereby  increasing  the  proba¬ 
bility  of  substrate  degradation**-^.  Sup¬ 
pose  that  a  rate-limiting  step,  which 
leads  to  the  first  proteolytic  cleavage  of 
the  proteasome-bound  substrate,  is  the 
unfolding  of  a  relevant  region  of  the  sub¬ 
strate  -  driven  by  thermal  fluctuations 
and/or  ATPases  of  the  26S  proteasome. 
If  so,  an  increase  in  stability  of  the 
proteasome-substrate  complex,  brought 
about  by  the  multi-Ub  chain,  should  facili¬ 
tate  substrate  degradation,  because  the 
longer  the  allowed  ‘waiting’  time,  the 
greater  the  probability  of  a  required 
unfolding  event. 

Identification  of  proteasome  subunits 
that  bind  multi-Ub  chains“-“  is  con¬ 
sistent  with  this  model.  Furthermore,  a 
17-kDa,  lysine-lacking  (and  therefore  non- 
ubiquitylatable)  fragment  of  the  70-kDa 
Sindbis  virus  polymerase  is  still  degraded 
by  the  N-end  rule  pathway  (T.  Rumenapf, 
J.  Strauss  and  A  Varshavsky,  unpub¬ 
lished).  Thus,  the  ubiquitylation  require¬ 
ment  characteristic  of  the  previously 
studied  N-end  rule  substrates  may  be  a 
consequence  of  their  relatively  stable 
conformations.  In  other  words,  the  deliv¬ 
ery  of  a  largely  unfolded  substrate  to  ac¬ 
tive  sites  in  the  interior  of  the  proteasome 
may  be  so  fast  that  the  dissociation- 
slowing  function  of  the  multi-Ub  chain  is 
no  longer  required. 

A  physiologically  relevant  example  of 
ATP-dependent,  Uihindependent  proteo¬ 
lysis  by  the  26S  proteasome  is  the  degra¬ 
dation  of  ornithine  decarbojgdase  (ODC) 
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complexed  with  antizyme,  an  ODC-binding 
protein  induced  by  polyamines^l  The 
Ub-independent  proteolysis  of  ODC  re¬ 
quires  the  presence  of  the  ODC-bound 
antizyme,  which  remains  intact  in  the 
course  of  ODC  degradation^^  The  bind¬ 
ing  of  antizyme  to  ODC  may  produce  an 
active  degradation  signal  in  ODC.  In  ad¬ 
dition,  the  antizyme  moiety  of  the  ODC- 
antizyme  complex  may  specifically  bind 
to  the  26S  proteasome,  in  which  case  it 
could  play  a  role  analogous  to  that  of  a 
substrate-linked  multi-Ub  chain. 

Physiological  functions  of  the  ubiquitin 
system 

The  effect  of  a  given  intracellular  pro¬ 
tein  on  the  rest  of  the  cell  is  determined 
in  part  by  the  concentration  of  active  pro¬ 
tein  molecules.  This  parameter,  in  turn, 
is  determined  by  the  rate  of  synthesis  of 
the  protein  in  relation  to  the  rate  of  its 
degradation,  inactivation  by  other  means 
or  export  from  the  compartment.  The 
vast  functional  range  of  the  Ub  system 
stems  from  the  diversity  of  its  physiologi¬ 
cal  substrates,  whose  number  in  a  cell  is 
likely  to  be  comparable  to  the  number 
of  substrates  of  phosphokinases.  in  other 
words,  it  is  the  constitutive  or  condi¬ 
tional  degradation  of  many  specific  pro¬ 
teins  (cyclins,  transcription  factors,  com¬ 
ponents  of  signal  transduction  pathways, 
damaged  proteins)  by  the  Ub  system 
that  underlies  the  diversity  of  its  roles, 
some  of  which  are  described  or  alluded 
to  above,  as  is  also  the  non-proteolytic, 
chaperonin  function  of  Ub*®.  Considered 
below  are  two  other  functions,  among  the 
many  that  are  already  known^*^*’^^'^’^®*^. 

Ubiquitin  as  a  signal  for  endocytosis. 
Proteins  embedded  in  the  plasma  mem¬ 
brane  are  removed  into  the  cell  interior 
through  endocytosis.  A  fraction  of  the 
removed  protein  is  eventually  returned 
to  the  plasma  membrane,  whereas  the 
rest  is  delivered  to  the  lysosome/vacuole 
and  degraded  there  by  vacuolar  pro¬ 
teases.  Hicke  and  Riezman^®  have  shown 
that  the  endocytosis  of  S.  cerevisiae 
Ste2p,  an  integral  membrane  protein  and 
receptor  for  the  a-factor  pheromone,  re¬ 
quires  ubiquitylation  of  a  specific  Lys 
residue  in  the  cytosol-exposed  C-terminal 
region  of  Ste2p.  This  lysine  is  located  in 
the  sequence  SINNDAKSS,  which  is  suffi¬ 
cient  for  receptor  internalization.  Other 
examples  of  this  new  Ub  function  have 
also  been  described®®^^  remains  to  be 
understood  why,  in  these  cases,  a  multi- 
Ub  chain  linked  to  a  cytosol-exposed  re¬ 
gion  of  a  transmembrane  protein  signals 
endocytosis  rather  than  degradation  by 
the  26S  proteasome. 


Degradation  of  compartmentalized  proteins 
retrotransported  to  the  cytosol.  It  has  been 
shown  that  proteins  translocated  from  the 
cytosol  into  the  endoplasmic  reticulum 
(ER)  may,  under  certain  conditions,  be 
retrotransported  back  to  the  cytosol, 
apparently  through  the  same  (bidirec¬ 
tional)  channel  in  the  ER  membrane®®^. 
At  least  some  of  the  retrotransported 
proteins  are  degraded  in  the  cytosol  by 
the  Ub  system,  whose  physiological 
substrates  thus  include  compartmental¬ 
ized  proteins  as  well.  A  major  class  of 
targets  for  retrotransport  are  proteins 
that  fail  to  fold  or  oligomerize  properly 
in  the  ER.  For  example,  a  defective  vac¬ 
uolar  carboxypeptidase  of  5.  cerevisiae 
is  retrotransported  to  the  cytosol  shortly 
after  entering  the  ER,  and  is  degraded 
by  a  Ub-proteasome-dependent  pathway 
that  requires  the  Ubc7p  E2  enzyme®^. 
Viruses  have  been  shown  to  take  advan¬ 
tage  of  the  retrotransport  machinery  to 
manipulate  the  immune  response  of  the 
infected  host.  Two  proteins  of  the  human 
cytomegalovirus,  USl  1  and  US2,  cause  the 
newly  translocated  class  !  heavy  chain 
of  the  major  histocompatibility  complex 
to  be  selectively  retrotransported  back 
to  the  cytosol,  where  the  heavy  chain  is 
destroyed  by  a  proteasome-dependent 
pathway®^. 

Many  compartmentalized  proteins, 
which  function  in  (or  pass  through)  the 
ER,  Golgi  and  related  compartments  bear 
N-terminal  residues  that  are  recognized 
as  destabilizing  by  the  Ub-dependent 
N-end  rule  pathway®  ®^.  In  contrast,  most 
of  the  cytosolic  and  nuclear  proteins  bear 
either  stabilizing  N-terminal  residues  or 
blocked  (acetylated)  amino  termini.  This 
difference  is  caused  in  part  by  the  cleav¬ 
age  specificity  of  signal  peptidases,  which 
remove  signal  sequences  from  proteins 
translocated  into  the  ER.  We  suggested 
that  one  function  of  the  N-end  rule  path¬ 
way  may  be  the  degradation  of  previously 
compartmentalized  proteins  that  return 
to  the  cytosol,  by  any  route,  from  com¬ 
partments  such  as  the  ER®.  Whether  the 
N-end  rule  pathway  mediates  the  degra¬ 
dation  of  at  least  some  retrotransported 
proteins  remains  to  be  determined. 

Concluding  remarks 

From  its  beginnings  in  the  1980s,  the 
field  of  Ub  studies  has  grown  enormously 
and  became  relevant  to  many  areas  of 
biology.  The  deepening  understanding 
of  the  Ub  system  is  likely  to  result, 
among  other  things,  in  powerful  experi¬ 
mental  tools  for  manipulating  the  in  vivo 
half-lives  of  intracellular  proteins  whose 
malfunction  or  overproduction  leads  to 
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-»cancer«nd  other  diseases.  This  approach 
to  pharmacological  intervention  should 
further  enlarge  the  already  immense 
scope  of  Ub  studies. 
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